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Claim 81: An isolated nucleic acid molecule which encodes a cancer associated antigen, the 
complementary sequence of which hybridizes, in whole or in part, to at least one of a nucleic acid 
molecule, the nucleotide sequence consists of the nucleotide sequence of SEQ ID NO: 15, SEQ 
ID NO: 22, or SEQ ID NO: 26. 

Claim 82: The isolated nucleic acid molecule of claim 81, the nucleotide sequence of which 
comprises the nucleotide sequence of SEQ ID NO: 15, SEQ ID NO: 22, or SEQ ID NO: 26. 
Claim 83 : The isolated nucleic acid molecule of claim 82, comprising the nucleotide 
sequence of SEQ ID NO: 15. 

Claim 84: The isolated nucleic acid molecule of claim 82, comprising the nucleotide 
sequence of SEQ ID NO: 22. 

Claim 85 : The isolated nucleic acid molecule of claim 82, comprising the nucleotide 
sequence of SEQ ID NO: 26. 

Claim 86: Expression vector comprising the isolated nucleic acid molecule of claim 8 1 , 
operably linked to a promoter. 

Claim 87: Eukaryotic cell line or prokaryotic cell strain, transformed or transfected with the 
expression vector of claim 86. 

Claim 88: Eukaryotic cell line or prokaryotic cell strain, transformed or transfected with the 
isolated nucleic acid molecule of claim 81. 

Claim 89: Isolated cancer associated antigen comprising all or part of the amino acid 
sequence encoded by SEQ ID NO: 15, 22 or 26. 
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Claim 90: The eukaryotic cell line or prokaryotic cell strain of claim 88, wherein said cell 

line is also transfected with a nucleic acid molecule coding for a cytokine. 

Claim 91: The eukaryotic cell line or prokaryotic cell strain of claim 90, wherein said cell 

line is further transfected by a nucleic acid molecule coding for an MHC molecule. 

Claim 92: The eukaryotic cell line or prokaryotic cell strain of claim 90, wherein said 

cytokine is an interleukin. 

Claim 93: The eukaryotic cell line or prokaryotic cell strain of claim 92, wherein said 
interleukin is IL-2, IL-4 or IL-12. 

Claim 94: The eukaryotic cell line or prokaryotic cell strain of claim 88, wherein said cell 
line has been rendered non-proliferative. 

Claim 95: The eukaryotic cell line of claim 88, wherein said cell line is a fibroblast cell line. 
Claim 96: Expression vector comprising a mutated or attenuated virus and the isolated 
nucleic acid molecule of claim 81. 

Claim 97: The expression vector of claim 96, wherein said virus is adenovirus or vaccinia 



virus. 



Claim 98: The expression vector of claim 97, wherein said virus is vaccinia virus. 
Claim 99: The expression vector of claim 97, wherein said virus is adenovirus. 
Claim 100: Expression system useful in transfecting a cell, comprising (i) a first vector 
containing a nucleic acid molecule which codes for the isolated cancer associated antigen of 
claim 89 and (ii) a second vector selected from the group consisting of (a) a vector containing a 
nucleic acid molecule which codes for an MHC or HLA molecule which presents an antigen 



661478.1 



LUD 5615.1-JEL/NDH (09905230) 

derived from said cancer associated antigen and (b) a vector containing a nucleic acid molecule 
which codes for an interleukin. 

Claim 101: Immunogenic composition comprising the isolated cancer antigen of claim 89, 
and apharmaceutically acceptable adjuvant. 

Claim 102: The immunogenic composition of claim 101, wherein said adjuvant is a cytokine, 
a saponin, or GM-CSF. 

Claim 1 03 : Immunogenic composition comprising at least one peptide consisting of an amino 
acid sequence of from 8 to 12 amino acids concatenated to each other in the isolated cancer 
associated cancer antigen of claim 89, and a pharmaceutically acceptable adjuvant. 
Claim 104: The immunogenic composition of claim 103, wherein said adjuvant is a saponin, a 
cytokine, or GM-CSF. 

Claim 105: The immunogenic composition of claim 102, wherein said composition comprises 
a plurality of peptides which complex with a specific MHC molecule. 

Claim 106: Immunogenic composition which comprises at least one expression vector which 
encodes a peptide derived from the amino acid sequence encoded by SEQ ID NO: 15, 22 or 26. 
Claim 107: The immunogenic composition of claim 106, wherein said at least one expression 
vector codes for a plurality of peptides. 

Claim 108: Vaccine useful in treating a subject afflicted with a cancerous condition 
comprising the isolated eukaryotic cell line of claim 88 and a pharmacologically acceptable 
adjuvant. 
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Claim 109: The vaccine of claim 108, wherein said eukaryotic cell line has been rendered 
non-proliferative. 

Claim 110: The vaccine of claim 1 09, wherein said eukaryotic cell line is a human cell line. 
Claim 111: A composition of matter useful in treating a cancerous condition comprising a 
non-proliferative cell line having expressed on its surface a peptide derived from the amino acid 
sequence encoded by SEQ ID NO: 15, 22 or 26. 

Claim 1 12: The composition of matter of claim 111, wherein said cell line is a human cell 
line. 

Claim 113: A composition of matter useful in treating a cancerous condition, comprising (i) a 

peptide derived from the amino acid sequence encoded by SEQ ID NO: 15, 22 or 26, (ii) an 

MHC or HLA molecule, and (iii) a pharmaceutically acceptable carrier. 

Claim 114: Isolated antibody which is specific for the cancer antigen of claim 89. 

Claim 115: The isolated antibody of claim 1 14, wherein said antibody is a monoclonal 

antibody. 

Claim 1 16: Method for screening for cancer in a sample, comprising contacting said sample 
with a nucleic acid molecule which hybridizes to all or part of the molecule encoded by SEQ ID 
NO: 15, 22 or 26 and determining hybridization as an indication of cancer cells in said sample. 
Claim 117: A method for screening for cancer in a sample, comprising contacting said sample 
with the isolated antibody of claim 114, and determining binding of said antibody to a target as 
an indicator of cancer. 
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Claim 118: Method for diagnosing a cancerous condition in a subject, comprising contacting 
an immune reactive cell containing sample of said subject to a cell line transfected with the 
isolated nucleic acid molecule of claim 81, and determining interaction of said transfected cell 
line with said immunoreactive cell, said interaction being indicative of said cancer condition. 
Claim 119: A method for determining regression, progression of onset of a cancerous 
condition comprising monitoring a sample from a patient with said cancerous condition for a 
parameter selected from the group consisting of (i) a protein encoded by SEQ ID NO: 15, 22 or 
26, (ii) a peptide derived from said protein, (iii) cytolytic T cells specific for said peptide and an 
MHC molecule with which it non-covalently complexes, and (iv) antibodies specific for said CT 
protein, wherein amount of said parameter is indicative of progression or regression or onset of 
said cancerous condition. 

Claim 120: The method of claim 119, wherein said sample is a body fluid or exudate. 
Claim 121 : The method of claim 119, wherein said sample is a tissue. 
Claim 1 22 : The method of claim 1 1 9, comprising contacting said sample with an antibody 
which specifically binds with said protein or peptide. 

Claim 123: The method of claim 122, wherein said antibody is labelled with a radioactive 
label or an enzyme. 

Claim 124: The method of claim 122, wherein said antibody is a monoclonal antibody. 
Claim 125: The method of claim 1 19, comprising amplifying RNA which codes for said 
protein. 
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Claim 126: The method of claim 125, wherein said amplifying comprises carrying out 
polymerase chain reaction. 

Claim 1 27 : The method of claim 118, comprising contacting said sample with a nucleic acid 
molecule which specifically hybridizes to a nucleic acid molecule which codes for or expresses 
said protein. 

Claim 128: The method of claim 127, wherein said nucleic acid molecule comprises SEQ ID 
NO: 17, 18, 20,21, 24,25, 28 or 29. 

Claim 1 29 : The method of claim 119, comprising assaying said sample for shed protein. 
Claim 130: The method of claim 1 1 9, comprising assaying said sample for antibodies specific 
for said protein, by contacting said sample with protein. 

Claim 131: Method for diagnosing a cancerous condition comprising assaying a sample taken 
from a subject for an immunoreactive cell specific for a peptide derived from a protein encoded 
by SEQ ID NO: 15, 22 or 26, complexed to an MHC molecule, presence of said immunoreactive 
cell being indicative of said cancerous condition. 

Claim 132: Composition comprising at least one peptide consisting of an amino acid sequence 
of from 8 to 25 amino acids concatenated to each other in the isolated cancer associated antigen 
of claim 89, and a pharmaceutically acceptable adjuvant. 

Claim 133: The composition of claim 132, wherein said adjuvant is a saponin, a cytokine, or 
GM-CSF. 

Claim 134: The composition of claim 132, comprising a plurality of MHC binding peptides. 
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Claim 135: Composition comprising an expression vector which encodes at least one peptide 

consisting of an amino acid sequence of from 8 to 25 amino acids concatenated to each other in 
the isolated cancer associated antigen of claim 89, and pharmaceutically acceptable adjuvant. 
Claim 136: The composition of claim 135, wherein said expression vector encodes a plurality 
of peptides. 

Claim 137: A method for screening for possible presence of a pathological condition, 
comprising assaying a sample from a patient believed to have a pathological condition for 
antibodies specific to at least one of the cancer associated antigens encoded by SEQ ID NO: 15, 
22 or 26, presence of said antibodies being indicative of possible presence of said pathological 
condition. 

Claim 138: The method of claim 137, wherein said pathological condition is cancer. 
Claim 139: The method of claim 137, wherein said cancer is melanoma. 
Claim 140: The method of claim 139, further comprising contacting said sample to a purified 
cancer associated antigen encoded by SEQ ID NO: 15, 22 or 26. 

Claim 141 : A method for screening for possible presence of a pathological condition in a 
subject, comprising assaying a sample taken from said subject for expression of a nucleic acid 
molecule, the nucleotide sequence of which comprises SEQ ID NO: 15, 22 or 26, expression of 
said nucleic acid molecule being indicative of possible presence of said pathological condition. 
Claim 142: The method of claim 141, wherein said pathological condition is cancer. 
Claim 143: The method of claim 141 , comprising determining expression via polymerase 
chain reaction. 
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Claim 144: The method of claim 141, comprising determining expression by contacting said 

sample with at least one of SEQ ID NO: 17, 18, 20, 21, 24, 25, 28, or 29. 

Claim 1 45 : A method for determining regression, progression of onset of a cancerous 

condition comprising monitoring a sample from a patient with said cancerous condition for a 

parameter selected from the group consisting of (i) a cancer associated antigen encoded by SEQ 

ID NO: 15, 22 or 26 (ii) a peptide derived from said cancer associated antigen, (iii) cytolytic T 

cells specific for said peptide and an MHC molecule with which it non-covalently complexes, 

and (iv) antibodies specific for said cancer associated antigen, wherein amount of said parameter 

is indicative of progression or regression or onset of said cancerous condition. 

Claim 146: The method of claim 145, wherein said sample is a body fluid or exudate. 

Claim 147: The method of claim 145, wherein said sample is a tissue. 

Claim 148: The method of claim 145, comprising contacting said sample with an antibody 

which specifically binds with said protein or peptide. 

Claim 149: The method of claim 148, wherein said antibody is labelled with a radioactive 
label or an enzyme. 

Claim 150: The method of claim 148, wherein said antibody is a monoclonal antibody. 
Claim 151 : The method of claim 145, comprising amplifying RNA which codes for said 
protein. 

Claim 152: The method of claim 151, wherein said amplifying comprises carrying out 
polymerase chain reaction. 
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Claim 1 53 : The method of claim 145, comprising contacting said sample with a nucleic acid 
molecule which specifically hybridizes to a nucleic acid molecule which codes for or expresses 
said protein. 

Claim 1 54: The method of claim 145, comprising assaying said sample for shed cancer 
associated antigen. 

Claim 1 55 : The method of claim 145, comprising assaying said sample for antibodies specific 
for said cancer associated antigen, by contacting said sample with said cancer associated antigen. 
Claim 1 56: Method for screening for a cancerous condition comprising assaying a sample 
taken from a subject for an immunoreactive cell specific for a peptide derived from a cancer 
associated antigen encoded by SEQ ID NO: 15, 22 or 26 complexed to an MHC molecule, 
presence of said immunoreactive cell being indicative of said cancerous condition. 
Claim 157: An isolated nucleic acid molecule consisting of a nucleotide sequence defined by 
SEQ ID NO: 15,22 or 26. 
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RELATED APPLICATIONS 

This application is a continuation in part of Serial No. 09/451 ,739 filed November 30, 1999, 
incorporated by reference in its entirety. 
FTELD OF THE INVENTION 

This inventionrelates to antigens associated with cancer, the nucleic acid molecules encoding 
them, as well as the uses of these. 
BACKGROUND AND PRIOR ART 

It is fairly well established that many pathological conditions, such as infections, cancer, 
autoimmune disorders, etc., are characterized by the inappropriate expression of certain molecules. 
These molecules thus serve as "markers" for a particular pathological or abnormal condition. Apart 
from their use as diagnostic "targets", i.e., materials to be identified to diagnose these abnormal 
conditions, the molecules serve as reagents which can be used to generate diagnostic and/or 
therapeutic agents. A by no means limiting example of this is the use of cancer markers to produce 
antibodies specific to a particular marker. Yet another non-limiting example is the use of a peptide 
which complexes with an MHC molecule, to generate cytolytic T cells against abnormal cells. 

Preparation of such materials, of course, presupposes a source of the reagents used to 
generate these. Purification from cells is one laborious, far from sure method of doing so. Another 
preferred method is the isolation of nucleic acid molecules which encode a particular marker, 
followed by the use of the isolated encoding molecule to express the desired molecule. 

Two basic strategies have been employed for the detection of such antigens, in e.g., human 
tumors. These will be referred to as the genetic approach and the biochemical approach. The 
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genetic approach is exemplified by, e.g., dePlaen et al., Proc. Natl. Sci. USA 85: 2275 (1988), 
incorporated by reference. In this approach, several hundred pools of plasmids of a cDNA library 
obtained from a tumor are transfected into recipient cells, such as COS cells, or into antigen-negative 
variants of tumor cell lines which are tested for the expression of the specific antigen. The 
biochemical approach, exemplified by, e.g., O. Mandelboim, et al., Nature 369: 69 (1994) 
incorporated by reference, is based on acidic elution of peptides which have bound to MHC-class 
I molecules of tumor cells, followed by reversed-phase high performance liquid chromography 
(HPLC). Antigenic peptides are identified after they bind to empty MHC-class I molecules of 
mutant cell lines, defective in antigen processing, and induce specific reactions with cytotoxic T- 
lymphocytes. These reactions include induction of CTL proliferation, TNF release, and lysis of 
target cells, measurable in an MTT assay, or a 51 Cr release assay. 

These two approaches to the molecular definition of antigens have the following 
disadvantages: first, they are enormously cumbersome, time-consuming and expensive; and second, 
they depend on the establishment of cytotoxic T cell lines (CTLs) with predefined specificity. 

The problems inherent to the two known approaches for the identification and molecular 
definition of antigens is best demonstrated by the fact that both methods have, so far, succeeded in 
defining only very few new antigens in human tumors. See, e.g., van der Bruggen et al., Science 
254: 1643-1647 (1991); Brichard et al., J. Exp. Med. 178: 489-495 (1993); Coulie, et al, J. Exp. 
Med. 180: 35-42 (1994); Kawakami, et al., Proc. Natl. Acad. Sci. USA 91: 3515-3519 (1994). 

Further, the methodologies described rely on the availability of established, permanent cell 
lines of the cancer type under consideration. It is very difficult to establish cell lines from certain 
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cancer types, as is shown by, e.g., Oettgen, et al, Immunol. Allerg. Clin. North. Am. 10:607-637 
(1990). It is also known that some epithelial cell type cancers are poorly susceptible to CTLs in 
vitro, precluding routine analysis. These problems have stimulated the art to develop additional 
methodologies for identifying cancer associated antigens. 

One key methodology is described by Sahin, et al., Proc. Natl. Acad. Sci. USA 92: 11810- 
11913 (1995), incorporated by reference. Also, see U.S. Patent No. 5,698,396, and Application 
Serial No. 08/479,328, filed on June 7, 1995 and January 3, 1996, respectively. All three of these 
references are incorporated by reference. To summarize, the method involves the expression of 
cDNA libraries in a prokaryotic host. (The libraries are secured from a tumor sample). The 
expressed libraries are then immunoscreened with absorbed and diluted sera, in order to detect those 
antigens which elicit high titer humoral responses. This methodology is known as the SEREX 
method ("Serological identification of antigens by Recombinant Expression Cloning"). The 
methodology has been employed to confirm expression of previously identified tumor associated 
antigens, as well as to detect new ones. See the above referenced patent applications and Sahin, et 
al., supra , as well as Crew, et al., EMBO J 144: 2333-2340 (1995). 

This methodology has been applied to a range of tumor types, including those described by 
Sahin et al., supra, and Pfreundschuh, supra, as well as to esophageal cancer (Chen et al., Proc. Natl. 
Acad. Sci. USA 94: 1914-1918 (1997)); lung cancer (Gure et al., Cancer Res. 58: 1034-1041 
(1998)); colon cancer (Serial No. 08/948, 705 filed October 10, 1997) incorporated by reference, and 
so forth. Among the antigens identified via SEREX are the SSX2 molecule (Sahin et al., Proc. Natl. 
Acad. Sci. USA 92: 11810-11813 (1995); Tureci et al., Cancer Res. 56: 4766-4772 (1996); NY- 



656796.1 



LUD 5615.1-JEL/NDH (1006411) 

ESO-1 Chen, et al., Proc. Natl. Acad. Sci. USA 94: 1914-1918 (1997); and SCP1 (Serial No. 
08/892,705 filed July 15, 1997) incorporated by reference. Analysis ofSEREX identified antigens 
has shown overlap between SEREX defined and CTL defined antigens. MAGE-1, tyrosinase, and 
NY-ESO-1 have all been shown to be recognized by patient antibodies as well as CTLs, showing 
that humoral and cell mediated responses do act in concert. 

It is clear from this summary that identification of relevant antigens via SEREX is a desirable 
aim. The inventors have applied this methodology and have identified several new antigens 
associated with cancer, as detailed in the description which follows. 
DF.TATLED DESCRIPTION OF PREFERRED EMBODIMENTS 



EXAMPLE 1 

The SEREX methodology, as described by, e.g. Sahin, et al., Proc. Natl. Acad. Sci. USA 92: 
11810-11813 (1995); Chen, et al, Proc. Natl. Acad. Sci. USA 94: 1914-1918 (1997), and U.S. 
Patent No. 5,698,396, all of which are incorporated by reference. In brief, total RNA was extracted 
from a sample of a cutaneous metastasis of abreast cancer patient (referred to as "BR1 1" hereafter), 
using standard CsCl guanidine thiocyanate gradient methodologies. A cDNA library was then 
prepared, using commercially available kits designed for this purpose. Following the SEREX 
methodology referred to supra, this cDNA expression library was amplified, and screened with either 
autologous BR1 1 serum which had been diluted to 1 :200, or with allogeneic, pooled serum, obtained 
from 7 different breast cancer patients, which had been diluted to 1:1000. To carry out the screen, 
samples were first diluted to 1 : 1 0, and then preabsorbed with lysates of Kcoli that had been 



serumi 
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transfected with naked vector, and the serum samples were then diluted to the levels described supra. 
The final dilutions were incubated overnight at room temperature with nitrocellulose membranes 
containing phage plaques, at a density of 4-5000 plaque forming units ("pfus") per 130 mm plate. 

Nitrocellulose filters were washed, and incubated with alkaline phosphatase conjugated, goat 
anti-human Fey secondary antibodies, and reactive phage plaques were visualized via incubation 
with 5-bromo-4-chloro-3-indolyl phosphate and nitroblue tetrazolium. 

This procedure was also carried out on a normal testicular cDNA library, using a 1 :200 serum 
dilution. 

A total of 1.12xl0 6 pfus were screened in the breast cancer cDNA library, and 38 positive 
clones were identified. With respect to the testicular library, 4x1 0 5 pfus were screened, and 28 
positive clones were identified. 

Additionally, 8xl0 5 pfus from the BR1 1 cDNA library were screened using the pooled serum 

described. Of these, 23 were positive. 

The positive clones were subcloned, purified, and excised to forms suitable for insertion in 
plasmids. Following amplification of the plasmids, DNA inserts were evaluated via restriction 
mapping (EcoRI-Xbal), and clones which represented different cDNA inserts were sequenced using 
standard methodologies. 

If sequences were identical to sequences found in GenBank, they were classified as known 
genes, while sequences which shared identity only with ESTs, or were identical to nothing in these 
data bases, were designated as unknown genes. Of the clones from the breast cancer library which 
positive with autologous serum, 3 were unknown genes. Of the remaining 35, 15 were 



were 
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identical to either NY-ESO-1, or SSX2, two known members of the CT antigen family described 
supra, while the remaining clones corresponded to 14 known genes. Of the testicular library, 12 of 

the clones were SSX2. 

The NY-ESO-1 antigen was not found, probably because the commercial library that was 
used had been size fractionated to have an average length of 1 .5 kilobases, which is larger than full 
length NY-ESO-1 cDNA which is about 750 base pairs long. 

With respect to the screening carried out with pooled, allogeneic sera, four of the clones were 
NY-ESO-1. No other CT antigens were identified. With the exception of NY-ESO-1, all of the 
genes identified were expressed universally in normal tissue. 

A full listing of the isolated genes, and their frequency of occurrence follows, in tables 1, 2 
and 3. Two genes were found in both the BR11 and testicular libraries, i.e., poly (ADP-ribose) 
polymerase, and tumor suppression gene ING1 . The poly (ADP-ribose) polymerase gene has also 
been found in colon cancer libraries screened via SEREX, as is disclosed by Scanlan, et al., Int. J. 
Cancer 76: 652-58 (1998) when the genes identified in the screening of the BR1 1 cDNA library by 
autologous and allogeneic sera were compared, NY-ESO-1 and human keratin. 

Table 1. SEREX-defined genes identified by autologous screening of BR11 cDNA library 

Gene group No. of clones Comments Expression 



CT genes 



10 



NY-ESO-1 



tumor, testis 



5 



SSX2 



tumor, testis 



Non-CT genes 



5 



Nuclear Receptor Co-Repressor 



ubiquitous 



4 



Poly( ADP-ribose) polymerase 
Adenylosuccinatelyase 



ubiquitous 



2 



ubiquitous 
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1 


NAD(+) ADP-Ribosyltransferase 


ubiquitous 


1 


Human keratin 10 


ESTs: multiple normal tissues 


1 


Human EGFR kinase substrate 


ubiquitous 




ING 1 Tumor suppressor gene 


RT-PCR: multiple normal tissues 




Unknown gene, NCI__CGAP_Prl2 
cDNA clone 


ESTs: pancreas, liver, spleen, uterus 




Unknown gene 


ESTs: multiple normal tissues 




Unknown gene 


RT-PCR: multiple normal tissues 


Table 2. SEREX-defined genes identified by allogeneic screening of BR11 cDNA library 


Gene group No. of clones 


Comments 


Expression 


CT genes 4 


NY-ESO-1 


tumor, testis 


Non-CT genes 6 


zinc-finger helicase 


ESTs: brain, fetal heart, total fetus 


4 


Acetoacetyl-CoA-thiolase 


ubiquitous 


3 


KIAA0330 gene 


ESTs: multiple normal tissues 


2 


UlsnRNP 


ubiquitous 


1 


Human aldolase A 


ubiquitous 


1 


Retinoblastoma binding protein 6 


ESTs: tonsils, fetal brain, endothelial 
cells, brain 


1 


a2-Macroglobulin receptor 
associated protein 


ubiquitous 


1 


Human Keratin 10 


ESTs: multiple normal tissues 


Table 3. SEREX-defined genes identified by screening of a testicular cDNA library with BR11 serum 


Gene group No. of clones 


Comments 


Expression 



8 
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CT genes: 



12 



SSX2 



tumor, testis 



Non-CT genes: 



3 Rho-associated coiled-coil forming 

protein 

3 Poly(ADP-ribose) polymerase 

3 Gene from HeLa cell, similar to 

TITIN 

2 Gene from parathyroid tumor 



Transcription termination factor 
I-interacting peptide 21 

Gene from fetal heart 

ING 1 tumor suppressor gene 

KIAA0647 cDNA 

KIAA0667 cDNA 



ubiquitous 

ubiquitous 
ubiquitous 

RT-PCR: multiple normal tissues 
ubiquitous 

ESTs: multiple normal tissues 
RT-PCR: multiple normal tissues 
ESTs: multiple normal tissues 
ESTs: multiple normal tissues 



EXAMPLE 2 

The mRNA expression pattern of the cDNAs identified in example 1, in both normal and 
malignant tissues, was studied. To do this, gene specific oligonucleotide primers were designed 
which would amplify cDNA segments 300-600 base pairs in length, using a primer melting 
temperature of 65-70° C. The primers used for amplifying MAGE- 1,2,3 and 4, B AGE, NY-ESO- 1 , 
SCP1, and SSX1, 2, 3, 4 and 5 were known primers, or were based on published sequences. See 
Chen, et al. suEia; Tureci, et al., Proc. Natl. Acad. Sci. USA 95: 5211-16 (1998). Gure, et al., Int. 
J. Cancer 72: 965-71 (1997); Chen, et al., Proc. Natl. Acad. Sci. USA 91: 1004-1008 (1994); 
Gaugler, et al., J. Exp. Med. 179: 921-930 (1994), dePlaen, et al, Immunogenetics 40: 360-369 
(1994), all of which are incorporated by reference. RT-PCR was carried out for 35 amplification 
cycles, at an annealing temperature of 60° C. Using this RT-PCR assay, the breast cancer tumor 
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specimen was positive for a broad range of CT antigens, including MAGE- 1,3 AND 4, BAGE, 
SSX2, NY-ESO-1 and CT7. The known CT antigens SCP-1, SSX1, 4 and 5 were not found to be 
expressed. 

An additional set of experiments were carried out, in which the seroreactivity of patient sera 
against tumor antigens was tested. Specially, ELIS As were carried out, in accordance with Stockert, 
et al., J. Exp. Med. 187: 1349-1354 (1998), incorporated by reference, to determine if antibodies 
were present in the patient sera. Assays were run for MAGE-1, MAGE-3, NY-ESO-1, and SSX2. 
The ELISAs were positive for NY-ESO-1 and SSX2, but not the two MAGE antigens. 

EXAMPLE 3 

Two clones (one from the breast cancer cDNA library and one from the testicular library), 
were identified as a gene referred to as ING1, which is a tumor suppressor gene candidate. See 
Garkavtsev, et al, Nature 391 : 295-8 (1998), incorporated by reference. The sequence found in the 
breast cancer library, differed from the known sequence of ING1 at six residues, i.e., positions 818, 
836, 855, 861, 866 and 874. The sequence with the six variants is set forth at SEQ ID NO: 1 . The 
sequence of wild type ING1 is set out at SEQ ID NO: 2. 

To determine if any of these differences represented a mutation in tumors, a short, PCR 
fragment which contained the six positions referred to supra was amplified from a panel of 
allogeneic normal tissue, subcloned, amplified, and sequenced following standard methods. 

The results indicated that the sequences in the allogeneic tissues were identical to what was 
found in tumors, ruling out the hypothesis that the sequence differences were a tumor associated 
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mutation. This conclusion was confirmed, using the testicular library clone, and using restriction 
analysis of ING1 cDNA taken from normal tissues. One must conclude, therefore, that the sequence 
information provided by Garkavtsev, et al., supra , is correct. 

EXAMPLE 4 

Additional experiments were carried out to determine whether genetic variations might exist 
in the 5' portion of the ING1 gene, which might differ from the 5' portion of the clone discussed 
supra (SEQ ID NO: 1). In a first group of experiments, attempts were made to obtain full length 
ING1 cDNA from both the breast tumor library, and the testicular library. SEQ ID NO: 1 was used 
as a probe of the library, using standard methods. 

Four clones were isolated from the testicular library and none were isolated from the breast 
cancer library. The four clones, following sequencing, were found to derive from three transcript 
variants. The three variants were identical from position 586 down to their 3' end, but differed in 
their 5 ' regions, suggesting alternatively spliced variants, involving the same exon-intron junction. 
All three differed from the sequence of ING1 described by Garkavtsev, et al., in Nat. Genet, 14:415- 
420 (1996). These three variants are set out as SEQ ID NOS: 1, 3 and 4. 

All of the sequences were then analyzed. The ORFs of SEQ ID NOS: 2, 1 and 4 (SEQ ID 
NO: 2 is the originally disclosed , ING1 sequence), encode polypeptides of 294, 279 and 235 amino 
acids, of which 233 are encoded by the 3 ' region common to the three sequences. These putative 
sequences are set out as SEQ ID NOS: 19, 5, and 7. With respect to SEQ ID NO: 3, however, no 
translational initiation site could be identified in its 5' region. 
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EXAMPLE 5 

The data regarding SEQ ID NO: 3, described supra, suggested further experiments to find 
additional ORFs in the 5-end of variant transcripts of the molecule. In order to determine this, 5'- 
RACE -PCR was carried out using gene specific and adapted specific primers, together with 
commercially available products, and standard methodologies. 

The primers used for these experiments were: 

CACACAGGATCCATGTTGAGTCCTGCCAACGG 

CGTGGTCGTGGTTGCTGGACGCG 

(SEQ ID NOS: 9 and 10), for SEQ ID NO: 1; 

CCCAGCGGCCCTGACGCTGTC 
CGTGGTCGTGGTTGCTGGACGCG 

(SEQ ID NOS: 1 1 and 12), for SEQ ID NO: 3; and 

GGAAGAGATAAGGCCTAGGGAAG 
CGTGGTCGTGGTTGCTGGACGCG 

(SEQ ID NOS: 13 and 14), for SEQ ID NO: 4. 

Cloning and sequencing of the products of RACE PCR showed that the variant sequence of 
SEQ ID NO: 4 was 5' to SEQ ID NO: 3, and that full length cDNA for the variant SEQ ID NO: 3 
contained an additional exon 609 nucleotides long, positioned between SEQ ID NO: 3 and the 
shared, 3' sequence referred to supra . This exon did not include an ORF. The first available 
initiation site would be an initial methionine at amino acid 70 of SEQ ID NO: 1 . Thus, if expressed, 
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SEQ ID NO: 3 would correspond to a molecule with a 681 base pair, untranslated 5' end and a 
region encoding 210 amino acids (SEQ ID NO:6). 



EXAMPLE 6 

The presence of transcript variants with at least 3 different trancriptional initiation sites, and 
possibly different promoters, suggested that mRNA expression might be under different, tissue 
specific regulation. 

To determine this, variant-specific primers were synthesized, and RT-PCR was carried out 
on a panel of tissues, using standard methods. 

SEQ ID NO: 1 was found to be expressed universally in all of the normal breast, brain and 
testis tissues examined, in six breast cancer lines, and 8 melanoma cell lines, and in cultured 
melanocytes. SEQ ID NO: 3 was found to be expressed in four of the six breast cancer lines, normal 
testis, liver, kidney, colon and brain. SEQ ID NO: 4 was only found to be expressed by normal testis 
cells and weakly in brain cells. 



EXAMPLE 7 

A further set of experiments were carried out to determine if antibodies against ING1 were 
present in sera of normal and cancer patients. A phase plaque immuno assay of the type described 
supra was carried out, using clones of SEQ ID NO: 1 as target. Of 14 allogeneic sera taken from 
breast cancer patients, two were positive at 1 :200 dilutions. All normal sera were negative. 
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EXAMPLE 8 

The BR11 cDNA library described supra was then screened, using SEQ ID NO: 1 and 
standard methodologies. A 593 base pair cDNA was identified, which was different from any 
sequences in the data banks consulted. The sequence of this cDNA molecule is set out at SEQ ID 
NO: 8. 

The cDNA molecule set forth as SEQ ID NO:l was then used in Southern blotting 
experiments. In brief, genomic DNA was isolated from normal human tissue, digested with BamHI 
or Hind III, and then separated onto 0.7% agarose gel, blotted onto nitrocellulose filters, and 
hybridized using 32 P labelled SEQ ID NO: 1, at high stringency conditions (aqueous buffer, 65 °C). 
The probes were permitted to hybridize overnight, and then exposed for autoradiography. Two 
hybridizing DNA species were identified, i.e., SEQ ID NOS: 1 and 8. 



EXAMPLE 9 

The cDNA molecule set forth in SEQ ID NO: 8 was then analyzed. 5'- RACE PCR was 
carried out using normal fetus cDNA. Full length cDNA for the molecule is 771 base pairs long, 
without the poly A tail. It shows strong homology to SEQ ID NO: 1, with the strongest homology 
in the 5 ' two-thirds (76% identity over nucleotide 1-480); however, the longest ORF is only 129 base 
pairs, and would encode a poly peptide 42 amino acids long which was homologous to, but much 
shorter than, the expected expression product of SEQ ID NO: 1. 

In addition to the coding region, SEQ ID NO: 8 contains 203 base pairs of 5 '-untranslated 
region, and 439 base pairs of 3 '-untranslated region. 

14 

656796.1 



LUD 5615.1-JEL/NDH (1006411) 

RT-PCR assays were carried out, as described supra . All of the normal tissues tested, 
including brain, colon, testis, tissue and breast, were positive for expression of this gene. Eight 
melanoma cell lines were tested, of which seven showed varying levels of expression, and one 
showed no expression. Six breast cancer cell lines were tested, of which four showed various levels 
of expression, and two showed no expression. 



EXAMPLE 10 

An additional breast cancer cDNA library, referred to as "BR17-128", was screened, using 
autologous sera. A cDNA molecule was identified. 

Analysis of the sequence suggested that it was incomplete at the 5' end. To extend the 
sequence, a testicular cDNA library was screened with a nucleotide probe based upon the partial 
sequence identified in the breast cancer library. An additional 1200 base pairs were identified 
following these screenings. The 201 1 base pairs of information are set forth in SEQ ID NO: 15. 

The longest open reading frame is 1539 base pairs, corresponding to aprotein of about 59. 15 
kilodaltons. The deduced sequence is set forth at SEQ ED NO: 16. 

RT-PCR was then carried out using the following primers: 

CACACAGGATCCATGCAGGCCCCGCACAAGGAG 
CACACAAAGCTTCTAGGATTTGGCACAGCCAGAG 

(SEQIDNOS: 17 and 18) 

Strong signals were observed in normal testis and breast tissue, and weak expression was 

observed in placenta. 
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No expression was found in normal brain, kidney, liver, colon, adrenal, fetal brain, lung, 
pancreas, prostate, thymus, uterus, and ovary tissue of tumor cell lines tested, 2 of the breast cancer 
lines were strongly positive and two were weakly positive. Of melanoma two of 8 were strongly 
positive, and 3 were weakly positive. Of lung cancer cell lines, 4 of 15 were strongly positive, and 

3 were weakly positive. 

When cancer tissue specimens were tested, 16 of 25 breast cancer samples were strongly 
positive, and 3 additional samples were weakly positive. Two of 36 melanoma samples were 
positive (one strong, one weak). All other cancer tissue samples were negative. 

When Northern blotting was carried out, a high molecular weight smear was observed in 
testis, but in no other tissues tested. 



EXAMPLE 11 

Further experiments were carried out using the tumor sample referred to in example 10, 
SU p ra . This sample was derived from a subcutaneous metastasis of a 60 year old female breast 
cancer patient. Total RNA was extracted, as described supra . Following the extraction, a cDNA 
library was constructed in X-ZAP expression vectors, also as described supra. Screening was carried 
out, using the protocol set forth in example 1. A total of 7 x 10 5 pfus were screened. Fourteen 
reactive clones were identified, purified, and sequenced. The sequences were then compared to 
published sequences in GenBank and EST databases. These analyses indicated that the clones were 
derived from seven distinct genes, two of which were known, and five unknown. The two known 
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genes were "PBK- 1 " (three clones), and TI-227 (one clone). These are universally expressed genes, 
with the libraries referred to supra showing ESTs for these genes from many different tissues. 

With respect to the remaining 10 clones, six were derived from the same gene, referred to 
hereafter as "NY-BR-1." Three cDNA sequences were found in the EST database which shared 
identity with the gene. Two of these (AI 951 1 18 and AW 373574) were identified as being derived 
from a breast cancer library, while the third (AW 170035), was from a pooled tissue source. 



EXAMPLE 12 

The distribution of the new gene NY-BR-1 referred to supra was determined via RT-PCR. 
In brief, gene specific oligonucleotide NY-BR-1 primers were designed to amplify cDNA segments 
300-600 base pairs in length, with primer melting temperatures estimated at 65-70 °C. 
The RT-PCR was then carried out over 30 amplification cycles, using a thermal cycler, and an 
annealing temperature of 60°C. Products were analyzed via 1 .5% gel electrophoresis, and ethidium 
bromide visualization. Fifteen normal tissues (adrenal gland, fetal brain, lung, mammary gland, 
pancreas, placenta, prostate, thymus, uterus, ovary, brain, kidney, liver, colon and testis) were 
assayed. The NY-BR-1 clone gave a strong signal in mammary gland and testis tissue, and a very 
faint signal in placenta. All other tissues were negative. The other clones were expressed 
universally, based upon comparison to information in the EST database library, and were not 
pursued further. 

The expression pattern of NY-BR-1 in cancer samples was then tested, by carrying out RT- 
PCR, as described supra, on tumor samples. 
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In order to determine the expression pattern, primers: 
caaagcagag cctcccgaga ag 

(SEQ ID NO: 20) and 

cctatgctgc tcttcgattc ttcc 

(SEQ ID NO: 21) were used. 

Of twenty-five breast cancer samples tested, twenty two were positive for NY-BR-L Of these, 
seventeen gave strong signals, and five gave weak to modest signals. 

An additional 82 non-mammary tumor samples were also analyzed, divided into 36 
melanoma, 26 non small cell lung cancer, 6 colon cancer, 6 squamous cell carcinoma, 6 transitional 
cell carcinoma, and two leiyomyosarcomas. Only two melanoma samples were positive for NY-BR- 
1 expression. 

The study was then extended to expression of NY-BR-1 in tissue culture. Cell lines derived 
from breast tumor, melanoma, and small cell lung cancer were studied. Four of six breast cancer 
cells were positive (two were very weak), four of eight melanoma (two very weak), and seven of 
fourteen small cell lung cancer lines (two very weak) were positive. 

EXAMPLE 13 

In order to determine the complete cDNA molecule for NY-BR-1 , the sequences of the six 
clones referred to supra were compiled, to produce a nucleotide sequence 1464 base pairs long. 
Analysis of the open reading frame showed a continuous ORF throughout, indicating that the 
compiled sequence is not complete. 
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Comparison of the compiled sequence with the three EST library sequences referred to supra 
allowed for extension of the sequence. The EST entry AW170035 (446 base pairs long) overlapped 
the compiled sequence by 89 base pairs at its 5' end, permitting extension of the sequence by another 
357 base pairs. A translational terminal codon was identified in this way, leading to a molecule with 
a 3'-untranslated region 333 base pairs long. The 5' end of the molecule was lacking, however, 
which led to the experiment described infra . 

EXAMPLE 14 

In order to determine the missing, 5' end of the clone described supra, a commercially 
available testis cDNA expression library was screened, using a PCR expression product of the type 
described supra as aprobe. In brief, 5xl0 4 pfusper 150 mm plate were transferred to nitrocellulose 
membranes, which were then submerged in denaturation solution (1.5M NaCl and 0.5 M NaOH), 
transferred to neutralization solution (1 .5 M NaCl and 0.5M Tris-HCl), and then rinsed with 0.2M 
Tris-HCl, and 2xSSC. Probes were labelled with 32 P and hybridization was carried out at high 
stringency conditions (i.e., 68°C, aqueous buffer). Any positive clones were subcloned, purified, 
and in vivo excised to plasmid PBK-CMV, as described supra. 

One of the clones identified in this way included an additional 1346 base pairs at the 5' end; 
however, it was not a full length molecule. A 5*-RACE-PCR was carried out, using commercially 
available products. The PCR product was cloned into plasmid vector pGEMT and sequenced. The 
results indicated that cDNA sequence was extended 1292 base pairs further, but no translation 
initiation site could be determined, because no stop codons could be detected. It could be concluded, 
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however, that the cDNA of the NY-BR17 clone comprises at least 4026 nucleotides, which are 
presented as SEQ ID NO: 22. The molecule, as depicted, encodes a protein at least about 1 52.8 kDA 
in molecular weight. Structurally, there are 99basepairs 5' to the presumed translation initiation site, 
and an untranslated segment 333 base pairs long at the 3' end. The predicted amino acid sequence 
of the coding region for SEQ ID NO: 22 is set out at SEQ. ID NO: 23. 

SEQ ID NO: 23 was analyzed for motifs, using the known search programs PROSITE and 
Pfam. A bipartite nuclear localization signal motif was identified at amino acids 17-34, suggesting 
that the protein is anuclear protein. Five tandem ankyrin repeats were identified, at amino acids 49- 
81, 82-114, 115-147, 148-180 and 181-213. AbZIP site (i.e. a DNA binding site followed by a 
leucine zipper motif) was found at amino acid positions 1077- 1 1 04, suggesting a transcription factor 
function. It was also observed that three repetitive elements were identified in between the ankyrin 
repeats and the bZIP DNA binding site. To elaborate, a repetitive element 117 nucleotides long is 
trandemly repeated 3 times, between amino acids 459-815. The second repetitive sequence, 
consisting of 1 1 amino acids, repeats 7 times between amino acids 224 and 300. The third repetitive 
element, 34 amino acids long, is repeated twice, between amino acids 301-368. 



EXAMPLE 15 

The six clones described supra were compared, and analysis revealed that they were derived 
from two different splice variants. Specifically, two clones, referred to as "BR17-8" and "BR 17- 
44a", contain one more exon, of 1 1 1 base pairs (nucleotides 301 5-3 125 of SEQ ID NO: 22), which 
encodes amino acids 973-1009 of SEQ ID NO: 23, than do clones BR 17-la, BR17-35b and BR17- 
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44b. The shortest of the six clones, BR17-128, starts 3' to the additional exons. The key structural 
elements referred to supra were present in both splice variants, suggesting that there was no 
difference in biological function. 

The expression pattern of the two splice variants was assessed via PT-PCR, using primers 
which spanned the 1 1 1 base pair exon referred to supra . 
The primers used were: 

aatgggaaca agagctctgc ag 

(SEQ ID NO: 24) and 

gggtcatctg aagttcagca ttc 

(SEQ ID NO: 25) 

Both variants were expressed strongly in normal testis and breast. The longer variant was dominant 
in testis, and the shorter variant in breast cells. When breast cancer cells were tested, co-typing of 
the variant was observed, (7 strongly, 2 weakly positive, and 1 negative), with the shorter variant 
being the predominant form consistently. 
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EXAMPLE 16 

The frequency of antibody response against NY-BR-1 in breast cancer patients was tested. 
To do this, a recombinant protein consisting of amino acids 993-1188 of SEQ ID NO: 23 was 
prepared. (This is the protein encoded by clone BR 17-128, referred to supra). A total of 140 serum 
samples were taken from breast cancer patients, as were 60 normal serum samples. These were 
analyzed via Western blotting, using standard methods. 

Four of the cancer sera samples were positive, including a sample from patient BR17. All 

normal sera were negative. 

An additional set of experiments was then carried out to determine if sera recognized the 
portion of NY-BR-1 protein with repetitive elements. To do this, a different recombinant protein, 
consisting of amino acids 405-1000 was made, and tested in Western blot assays. None of the four 
antibody positive sera reacted with this protein indicating that an antibody epitope is located in the 
non-repetitive, carboxy terminal end of the molecule. 
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EXAMPLE 17 

The screening of the testicular cDNA library referred to supra resulted, inter alia, in the 
identification of a cDNA molecule that was homologous to NY-BR-1 . The molecule is 3673 base 
pairs in length, excluding the poly A tail. This corresponded to nucleotides 1-3481 of SEQ ID NO: 
22, and showed 62% homology thereto. No sequence identity to sequences in libraries was noted. 
ORF analysis identified an ORF from nucleotide 641 through the end of the sequence, with 54% 
homology to the protein sequence of SEQ. ID NO: 23. The ATG initiation codon of this sequence 
is 292 base pairs further 3' to the presumed initiation codon of NY-BR-1, and is preceded by 640 
untranslated base pairs at its 5' end. This 640 base pair sequence includes scattered stop codons. 
The nucleotide sequence and deduced amino acid sequence are presented as SEQ ID NOS: 26 and 
27, respectively. 

RT-PCR analysis was carried out in the same way as is described supra, using primers: 
tct catagat gctggtgctg ate 

(SEQ ID NO: 28) and 

cccagacatt gaattttggc agac 

(SEQ ID NO: 29). 

Tissue restricted mRNA expression was found. The expression pattern differed from that of SEQ 
ID NO: 22. In brief, of six normal tissues examined, strong signals were found in brain and testis 
only. There was no or weak expression in normal breast tissues, and kidney, liver and colon tissues 
: negative. Eight often 10 breast cancer specimens tested supra were positive for SEQ. ID NO: 



were i 
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26. Six samples were positive for both SEQ. ID NO: 22 and 26, one for SEQ. ID NO: 22 only, two 
for the SEQ. ID NO: 26 only, and one was negative for both. 

The foregoing examples describe the isolation of a nucleic acid molecule which encodes a 
cancer associated antigen. "Associated" is used herein because while it is clear that the relevant 
molecule was expressed by several types of cancer, other cancers, not screened herein, may also 
express the antigen. 

The invention relates to nucleic acid molecules which encode the antigens encoded by, e.g., 
SEQ ID NOS: 1, 3, 8, 15, 22 and 26 as well as the antigens encoded thereby, such as the proteins 
with the amino acid sequences of SEQ ID NOS: 5, 6, 7, 16, 23 and 27. It is to be understood that 
all sequences which encode the recited antigen are a part of the invention. 

Also a part of the invention are expression vectors which incorporate the nucleic acid 
molecules of the invention, in operable linkage (i.e., "operably linked") to apromoter. Construction 
of such vectors, such as viral (e.g., adenovirus or Vaccinia virus) or attenuated viral vectors is well 
within the skill of the art, as is the transformation or transfection of cells, to produce eukaryotic cell 
lines, or prokaryotic cell strains which encode the molecule of interest. Exemplary of the host cells 
which can be employed in this fashion are COS cells, CHO cells, yeast cells, insect cells (e.g., 
Spodo ptera frueiperda) , NIH 3T3 cells, and so forth. Prokaryotic cells, such as E. coli and other 
bacteria may also be used. Any of these cells can also be transformed or transfected with further 
nucleic acid molecules, such as those encoding cytokines, e.g., interleukins such as IL-2, 4, 6, or 12 
or HLA or MHC molecules. 
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Also a part of the invention are the antigens described herein, both in original form and in 
any different post translational modified forms. The molecules are large enough to be antigenic 
without any posttranslational modification, and hence are useful as immunogens, when combined 
with an adjuvant (or without it), in both precursor and post-translationally modified forms. 
Antibodies produced using these antigens, both poly and monoclonal, are also a part of the invention 
as well as hybridomas which make monoclonal antibodies to the antigens. The whole protein can 
be used therapeutically, or in portions, as discussed infra. Also apart of the invention are antibodies 
against this antigen, be these polyclonal, monoclonal, reactive fragments, such as Fab, (F(ab) 2 ' and 
other fragments, as well as chimeras, humanized antibodies, recombinantly produced antibodies, and 
so forth. 

As is clear from the disclosure, one may use the proteins and nucleic acid molecules of the 
invention diagnostically. The SEREX methodology discussed herein is premised on an immune 
response to a pathology associated antigen. Hence, one may assay for the relevant pathology via, 
e.g., testing a body fluid sample of a subject, such as serum, for reactivity with the antigen per se. 
Reactivity would be deemed indicative of possible presence of the pathology. So, too, could one 
assay for the expression of any of the antigens via any of the standard nucleic acid hybridization 
assays which are well known to the art, and need not be elaborated upon herein. One could assay for 
antibodies against the subject molecules, using standard immunoassays as well. 

Analysis of SEQ ID NO: 1, 3, 4, 8, 15, 22 and 26 will show that there are 5' and 3' non- 
coding regions presented therein. The invention relates to those isolated nucleic acid molecules 
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which contain at least the coding segment, and which may contain any or all of the non-coding 5' 
and 3' portions. 

Also a part of the invention are portions of the relevant nucleic acid molecules which can be 
used, for example, as oligonucleotide primers and/or probes, such as one or more of SEQ ID NOS: 
9, 10, 1 1, 12, 13, 14, 17, 18,20, 21, 24, 25, 28, and 29 as well as amplification products like nucleic 
acid molecules comprising at least nucleotides 305-748 of SEQ ID NO: 1 , or amplification products 
described in the examples, including those in examples 12, 14, etc. 

As was discussed supra, study of other members of the "CT" family reveals that these are 
also processed to peptides which provoke lysis by cytolytic T cells. There has been a great deal of 
work on motifs for various MHC or HLA molecules, which is applicable here. Hence, a further 
aspect of the invention is a therapeutic method, wherein one or more peptides derived from the 
antigens of the invention which bind to an HLA molecule on the surface of a patient's tumor cells 
are administered to the patient, in an amount sufficient for the peptides to bind to the MHC/HLA 
molecules, and provoke lysis by T cells. Any combination of peptides may be used. These peptides, 
which may be used alone or in combination, as well as the entire protein or immunoreactive portions 
thereof, may be administered to a subject in need thereof, using any of the standard types of 
administration, such as intravenous, intradermal, subcutaneous, oral, rectal, and transdermal 
administration. Standard pharmaceutical carriers, adjuvants, such as saponins, GM-CSF, and 
interleukins and so forth may also be used. Further, these peptides and proteins may be formulated 
into vaccines with the listed material, as may dendritic cells, or other cells which present relevant 
MHC/peptide complexes. 
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Similarly, the invention contemplates therapies wherein nucleic acid molecules which encode 
the proteins of the invention, one or more or peptides which are derived from these proteins are 
incorporated into a vector, such as a Vaccinia or adenovirus based vector, to render it transferable 
into eukaryotic cells, such as human cells. Similarly, nucleic acid molecules which encode one or 
more of the peptides may be incorporated into these vectors, which are then the major constituent 
of nucleic acid bases therapies. 

Any of these assays can also be used in progression/regression studies. One can monitor the 
course of abnormality involving expression of these antigens simply by monitoring levels of the 
protein, its expression, antibodies against it and so forth using any or all of the methods set forth 
supra . 

It should be clear that these methodologies may also be used to track the efficacy of a 
therapeutic regime. Essentially, one can take a baseline value for a protein of interest using any of 
the assays discussed supra, administer a given therapeutic agent, and then monitor levels of the 
protein thereafter, observing changes in antigen levels as indicia of the efficacy of the regime. 

As was indicated supra , the invention involves, inter alia , the recognition of an "integrated" 
immune response to the molecules of the invention. One ramification of this is the ability to monitor 
the course of cancer therapy. In this method, which is a part of the invention, a subject in need of the 
therapy receives a vaccination of a type described herein. Such a vaccination results, e.g., in a T cell 
response against cells presenting HLA/peptide complexes on their cells. The response also includes 
an antibody response, possibly a result of the release of antibody provoking proteins via the lysis of 
cells by the T cells. Hence, one can monitor the effect of a vaccine, by monitoring an antibody 
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response. As is indicated, supra, an increase in antibody titer may be taken as an indicia of progress 
with a vaccine, and vice versa. Hence, a further aspect of the invention is a method for monitoring 
efficacy of a vaccine, following administration thereof, by deteirnining levels of antibodies in the 
subject which are specific for the vaccine itself, or a large molecule of which the vaccine is a part. 

The identification of the subj ect proteins as being implicated in pathological conditions such 
as cancer also suggests a number of therapeutic approaches in addition to those discussed supra. The 
experiments set forth supra establish that antibodies are produced in response to expression of the 
protein. Hence, a further embodiment of the invention is the treatment of conditions which are 
characterized by aberrant or abnormal levels of one or more of the proteins, via administration of 
antibodies, such as humanized antibodies, antibody fragments, and so forth. These may be tagged 
or labelled with appropriate cystostatic or cytotoxic reagents. 

T cells may also be administered. It is to be noted that the T cells may be elicited in vitro 
using immune responsive cells such as dendritic cells, lymphocytes, or any other immune responsive 
cells, and then reperfused into the subject being treated. 

Note that the generation of T cells and/or antibodies can also be accomplished by 
administering cells, preferably treated to be rendered non-proliferative, which present relevant T cell 
or B cell epitopes for response, such as the epitopes discussed supra. 

The therapeutic approaches may also include antisense therapies, wherein an antisense 
molecule, preferably from 1 0 to 1 00 nucleotides in length, is administered to the subj ect either "neat" 
or in a carrier, such as a liposome, to facilitate incorporation into a cell, followed by inhibition of 
expression of the protein. Such antisense sequences may also be incorporated into appropriate 
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vaccines, such as in viral vectors (e.g., Vaccinia), bacterial constructs, such as variants of the known 
BCG vaccine, and so forth. 

Other features and applications of the invention will be clear to the skilled artisan, and need 
not be set forth herein. The terms and expression which have been employed are used as terms of 
description and not of limitation, and there is no intention in the use of such terms and expression 
of excluding any equivalents of the features shown and described or portions thereof, it being 
recognized that various modifications are possible within the scope of the invention. 
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We claim : 

1. Isolated nucleic acid molecule which encodes a cancer associated antigen, whose 
amino acid sequence is identical to the amino acid sequence encoded by the nucleotide sequence of 
SEQ ID NO: 1,3,4, 8, 15 or 19. 

2. The isolated nucleic acid molecule of claim 1 , comprising the nucleotide sequence 

of SEQ ID NO: 1. 

3 . The isolated nucleic acid molecule of claim 1 , comprising the nucleotide sequence 
ofSEQIDNO:3. 

4. The isolated nucleic acid molecule of claim 1 , comprising the nucleotide sequence 
of SEQ ID NO: 4. 

5. The isolated nucleic acid molecule of claim 1 , comprising the nucleotide sequence 
of SEQ ID NO: 8. 

6. The isolated nucleic acid molecule of claim 1 , comprising the nucleotide sequence 
of SEQ ID NO: 15. 

7. The isolated nucleic acid molecules of claim 1 , comprising the nucleotide sequence 
of SEQ ID NO: 19. 

8. Expression vector comprising the isolated nucleic acid molecule of claim 1 , operably 
linked to a promoter. 

9. Eukaryotic cell line or prokaryotic cell strain, transformed or transfected with the 
expression vector of claim 8. 
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1 0 . Isolated cancer associated antigen comprising all or part of the amino acid sequence 
encoded by SEQ ID NO: 1, 3, 4, 8, 15 or 19. 

11. Eukaryotic cell line or prokaryotic cell strain, transformed or transfected with the 
isolated nucleic acid molecule of claim 1. 

12. The eukaryotic cell line or prokaryotic cell strain of claim 1 1 , wherein said cell line 
is also transfected with a nucleic acid molecule coding for a cytokine. 

1 3 . The eukaryotic cell line or prokaryotic cell strain of claim 1 2, wherein said cell line 
is further transfected by a nucleic acid molecule coding for an MHC molecule. 

14. The eukaryotic cell line or prokaryotic cell strain of claim 12, wherein said cytokine 
is an interleukin. 

1 5 . The eukaryotic cell line or prokaryotic cell strain of claim 1 4, wherein said interleukin 
is IL-2, IL-4 or IL-12. 

1 6. The eukaryotic cell line or prokaryotic cell strain of claim 1 1 , wherein said cell line 
has been rendered non-proliferative. 

17. The eukaryotic cell line of claim 1 1, wherein said cell line is a fibroblast cell line. 

1 8 . Expression vector comprising a mutated or attenuated virus and the isolated nucleic 

acid molecule of claim 1. 

19. The expression vector of claim 1 8, wherein said virus is adenovirus or vaccinia virus. 

20. The expression vector of claim 19, wherein said virus is vaccinia virus. 

2 1 . The expression vector of claim 1 9, wherein said virus is adenovirus. 
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22. Expression system useful in transfecting a cell, comprising (i) a first vector containing 
a nucleic acid molecule which codes for the isolated cancer associated antigen of claim 1 1 and (ii) 
a second vector selected from the group consisting of (a) a vector containing a nucleic acid molecule 
which codes for an MHC or HLA molecule which presents an antigen derived from said cancer 
associated antigen and (b) a vector containing a nucleic acid molecule which codes for an 
interleukin. 

23. Immunogenic composition comprising the isolated cancer antigen of claim 10, and 
a pharmaceutical^ acceptable adjuvant. 

24. The immunogenic composition of claim 23, wherein said adjuvant is a cytokine, a 
saponin, or GM-CSF. 

25. Immunogenic composition comprising at least one peptide consisting of an amino 
acid sequence of from 8 to 12 amino acids concatenated to each other in the isolated cancer 
associated cancer antigen of claim 11, and a pharmaceutically acceptable adjuvant. 

26. The immunogenic composition of claim 25, wherein said adjuvant is a saponin, a 
cytokine, or GM-CSF. 

27. The immunogenic composition of claim 24, wherein said composition comprises a 
plurality of peptides which complex with a specific MHC molecule. 

28. Immunogenic composition which comprises at least one expression vector which 
encodes a peptide derived from the amino acid sequence encoded by SEQ ID NO: 1, 3, 4, 8, 15 or 
19. 
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29. The immunogenic composition of claim 28, wherein said at least one expression 
vector codes for a plurality of peptides. 

30. Vaccine useful in treating a subject afflicted with a cancerous condition comprising 
the isolated eukaryotic cell line of claim 1 1 and a pharmacologically acceptable adjuvant. 

3 1 . The vaccine of claim 30, wherein said eukaryotic cell line has been rendered non- 
proliferative. 

32. The vaccine of claim 3 1 , wherein said eukaryotic cell line is a human cell line. 

33. A composition of matter useful in treating a cancerous condition comprising a non- 
proliferative cell line having expressed on its surface a peptide derived from the amino acid sequence 
encoded by SEQ ID NO: 1, 3, 4, 8, 15 or 19. 

34. The composition of matter of claim 33, wherein said cell line is a human cell line. 

35. A composition of matter useful in treating a cancerous condition, comprising (i) a 
peptide derived from the amino acid sequence encoded by SEQ ID NO: 1, 3, 4, 8, 15 or 19, (ii) an 
MHC or HLA molecule, and (iii) a pharmaceutical^ acceptable carrier. 

36. Isolated antibody which is specific for the cancer antigen of claim 10. 

37. The isolated antibody of claim 36, wherein said antibody is a monoclonal antibody. 
3 8 . Method for screening for cancer in a sample, comprising contacting said sample with 

a nucleic acid molecule which hybridizes to all or part of the molecule encoded by SEQ ID NO: 1, 
2, 3, 4, 8, 15 or 19 and determining hybridization as an indication of cancer cells in said sample. 
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39. A method for screening for cancer in a sample, comprising contacting said sample 
with the isolated antibody of claim 36, and determining binding of said antibody to a target as an 
indicator of cancer. 

40. Method for diagnosing a cancerous condition in a subj ect, comprising contacting an 
immune reactive cell containing sample of said subject to a cell line transfected with the isolated 
nucleic acid molecule of claim 1, and determining interaction of said transfected cell line with said 
immunoreactive cell, said interaction being indicative of said cancer condition. 

41. A method for determining regression, progression of onset of a cancerous condition 
comprising monitoring a sample from a patient with said cancerous condition for a parameter 
selected from the group consisting of (i) a protein encoded by SEQ ID NO: 1, 2, 3, 4, 8, 15 or 19, 
(ii) a peptide derived from said protein, (iii) cytolytic T cells specific for said peptide and an MHC 
molecule with which it non-covalently complexes, and (iv) antibodies specific for said CT protein, 
wherein amount of said parameter is indicative of progression or regression or onset of said 

cancerous condition. 

42. The method of claim 4 1 , wherein said sample is a body fluid or exudate. 

43 . The method of claim 4 1 , wherein said sample is a tissue. 

44. The method of claim 41, comprising contacting said sample with an antibody which 
specifically binds with said protein or peptide. 

45 . The method of claim 44, wherein said antibody is labelled with a radioactive label 

or an enzyme. 

46. The method of claim 44, wherein said antibody is a monoclonal antibody. 
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47. The method of claim 41, comprising amplifying RNA which codes for said protein. 

48. The method of claim 47, wherein said amplifying comprises carrying out polymerase 
chain reaction. 

49. The method of claim 40, comprising contacting said sample with a nucleic acid 
molecule which specifically hybridizes to a nucleic acid molecule which codes for or expresses said 
protein. 

50. The method of claim 49, wherein said nucleic acid molecule comprises SEQ ID NO: 

9,10,11,12, 13,14, 17 or 18. 

5 1 . The method of claim 41 , comprising assaying said sample for shed protein. 

52. The method of claim 41, comprising assaying said sample for antibodies specific for 
said protein, by contacting said sample with protein. 

53. Method for diagnosing a cancerous condition comprising assaying a sample taken 
from a subject for an immunoreactive cell specific for a peptide derived from a protein encoded by 
SEQ ED NO: 1, 2, 3, 4, 8, 15 or 19, complexed to an MHC molecule, presence of said 
immunoreactive cell being indicative of said cancerous condition. 

54. Composition comprising at least one peptide consisting of an amino acid sequence 
of from 8 to 25 amino acids concatenated to each other in the isolated cancer associated antigen of 
claim 10, and a pharmaceutical^ acceptable adjuvant. 

5 5 . The composition of claim 54, wherein said adjuvant is a saponin, a cytokine, or GM- 

CSR 

56. The composition of claim 54, comprising a plurality of MHC binding peptides. 
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57. Composition comprising an expression vector which encodes at least one peptide 
consisting of an amino acid sequence of from 8 to 25 amino acids concatenated to each other in the 
isolated cancer associated antigen of claim 10, and pharmaceutical^ acceptable adjuvant. 

58. The composition of claim 57, wherein said expression vector encodes a plurality of 
peptides. 

59. A method for screening for possible presence of apathological condition, comprising 
assaying a sample from a patient believed to have a pathological condition for antibodies specific 
to at least one of the cancer associated antigens encoded by SEQ ID NOS: 1, 2, 3, 4, 8, 15 or 19, 
presence of said antibodies being indicative of possible presence of said pathological condition. 

60. The method of claim 59, wherein said pathological condition is cancer. 

6 1 . The method of claim 59, wherein said cancer is melanoma. 

62. The method of claim 6 1 , further comprising contacting said sample to purified cancer 
associated antigen encoded by SEQ ID NO: 1, 3, 4, 8, or 15. 

63 . A method for screening for possible presence of a pathological condition in a subj ect, 
comprising assaying a sample taken from said subject for expression of anucleic acid molecule, the 
nucleotide sequence of which comprises SEQ ID NO: 1, 2, 3, 4, 8, 15 or 19, expression of said 
nucleic acid molecule being indicative of possible presence of said pathological condition. 

64. The method of claim 63, wherein said pathological condition is cancer. 

65 . The method of claim 63, comprising determining expression via polymerase chain 
reaction. 
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66. The method of claim 63, comprising determining expression by contacting said 
sample with at least one of SEQ ID NO: 9, 10, 11, 12, 13 or 14. 

67. A method for determining regression, progression of onset of a cancerous condition 
comprising monitoring a sample from a patient with said cancerous condition for a parameter 
selected from the group consisting of (i) a cancer associated antigen encoded by SEQ ID NO: 1, 2, 
3, 4, 8, 15 or 19 (ii) a peptide derived from said cancer associated antigen, (iii) cytolytic T cells 
specific for said peptide and an MHC molecule with which it non-covalently complexes, and (iv) 
antibodies specific for said cancer associated antigen, wherein amount of said parameter is indicative 
of progression or regression or onset of said cancerous condition. 

68. The method of claim 67, wherein said sample is a body fluid or exudate. 

69. The method of claim 67, wherein said sample is a tissue. 

70. The method of claim 67, comprising contacting said sample with an antibody which 
specifically binds with said protein or peptide. 

71. The method of claim 70, wherein said antibody is labelled with a radioactive label 

or an enzyme. 

72. The method of claim 70, wherein said antibody is a monoclonal antibody. 

73 . The method of claim 67, comprising amplifying RNA which codes for said protein. 

74. The method of claim 73 , wherein said amplifying comprises carrying out polymerase 
chain reaction. 
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75. The method of claim 67, comprising contacting said sample with a nucleic acid 
molecule which specifically hybridizes to a nucleic acid molecule which codes for or expresses said 
protein. 

76. The method of claim 67, comprising assaying said sample for shed cancer associated 
antigen. 

77. The method of claim 67, comprising assaying said sample for antibodies specific for 
said cancer associated antigen, by contacting said sample with said cancer associated antigen. 

78. Method for screening for a cancerous condition comprising assaying a sample taken 
from a subject for an immunoreactive cell specific for a peptide derived from a cancer associated 
antigen encoded by SEQ ID NO: 1, 2, 3, 4, 8, 15 or 19 complexed to an MHC molecule, presence 
of said immunoreactive cell being indicative of said cancerous condition. 

79. An isolated nucleic acid molecule consisting of a nucleotide sequence defined by 

SEQ ID NO: 1, 2, 3, 8, 15 or 19. 

80. Isolated nucleic acid molecule the complimentary sequence of which hydridizes, 
under stringent conditions, to the nucleotide sequence set forth in SEQ ID NO: 4, 5, 8, 15 or 19. 
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ABSTRACT OF THE DISCLOSURE 



The invention relates to newly identified cancer associated antigens. It has been discovered 
that each of these molecules provokes antibodies when expressed by a subject. The ramifications 
of this observation are also apart of this invention. 
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gagacagacg 


gggcgcagaa 


gcgguyyd uy 


agecaggage 


tyy y uyaL-ya 


y dciy ct Luuciy 


aaccgcacgc 


ggcaggtgga 


Cayccdty uy 


gacacagtgg 


geaacagegg 


caaggc uggo 


cagtctgaca 


agcccaacag 


caagcgctca 


aacgcgtcca 


gcaaccacga 


ccacgacgac 


gccaagacct 


ccaagaagaa 


gaagcgctcc 


gccgacctcc 


ccatcgaccc 


caacgaaccc 


ggggagatga 


teggctgega 


t-aauyaoyay 


gtggggctca 


atcataaacc 


Ccidy y yoddy 


gagaagacca 


tggacaaagc 


cctggagaaa 


-f- 4- 4— r"r 4- /*T/~T 3 r~* 3 


yyoyL-^uyy l. 


y uyciyyctyyci 


gcctttgttg 


aggtgeaagg 


agtgtaaaat 


ccattccttt 


catagggatg 


gcagtgattc 


gtaacaagaa 


agtggtctgt 


ggatcagcat 


aca 







ggggagatga teggctgega caacgacgag 1140 
gtggggctca atcataaacc caagggcaag 1200 
gagaagacca tggacaaagc cctggagaaa 1260 
tttgtggaca ggcgcctggt gtgaggagga 1320 
gcctttgttg aggtgeaagg agtgtaaaat 1380 
ccattccttt catagggatg gcagtgattc 1440 
gtaacaagaa agtggtctgt ggatcagcat 1500 
aca 1533 



aeggegaega gaattcccag atatagcagt 60 
eggggctgea gttcggaccg cctcccgcga 120 
geatctttge tgacccgagg gtggggccgc 180 
ctagacgagt getacgageg cttcagtcgc 240 
ctgcactgtg tgcagcgcgc gctgatccgc 300 
ategtgagee agatggtgga gctggtggag 3 60 
gagctgttcg aggegcagea ggagctgggc 420 
geggacagge ecaatggega tgcggtagcg 4 80 
cggcggcagc gcaacaacga gaaccgtgag 540 
ggcgcctcgg gcacacccaa ggagaagaag 600 
aaggecaagg eggagegaga ggcgtcccct 660 
acgtactgtc tgtgcaacca ggtctcctat 720 
tgccccatcg agtggttcca cttctcgtgc 780 
tggtactgtc ccaagtgccg gggggagaac 840 
tccaaaaaag agagggctta caacaggtag 900 
caaaataaac cgtgtattta ttacattget 960 
gtatattttt aaagaatgtt agaaaaggaa 1020 
tgtttgcctt ttgttttcat tggtacacgt 1080 
tttagaaact acaaatatag gtttgattca 1140 

1143 



<210> 3 
<211> 742 
<212> DNA 



<213> Homo sapiens 

<220> 

<400> 3 



cgccgtccac 


accccagcgg 


ccctgacgct 


gtcccctccg 


cgaccctcgc 


ctctggaaaa 


60 


agtgacaggc 


aaggccacgc 


ccccgcgagg 


gccggcctcg 


agcccgcagc 


ccccagggcc 


120 


tgggacgaga 


tcctgaagga 


gctagacgag 


tgctacgagc 


gcttcagtcg 


cgagacagac 


180 


ggggcgcaga 


agcggcggat 


gctgcactgt 


gtgcagcgcg 


cgctgatccg 


cagccaggag 


240 


ctgggcgacg 


agaagatcca 


gatcgtgagc 


cagatggtgg 


agctggtgga 


gaaccgcacg 


300 


cggcaggtgg 


acagccacgt 


ggagctgttc 


gaggcgcagc 


aggagctggg 


cgacacagcg 


360 


ggcaacagcg 


gcaaggctgg 


cgcggacagg 


cccaaaggcg 


aggcggcagc 


gcaggctgac 


420 


aagcccaaca 


gcaagcgctc 


acggcggcag 


cgcaacaacg 


agaaccgtga 


gaacgcgtcc 


480 


agcaaccacg 


accacgacga 


cggcgcctcg 


ggcacaccca 


aggagaagaa 


ggccaagacc 


540 


tccaagaaga 


agaagcgctc 


caaggccaag 


gcggagcgag 


aggcgtcccc 


tgccgacctc 


600 


cccatcgacc 


ccaacgaacc 


cacgtactgt 


ctgtgcaacc 


aggtctccta 


tggggagatg 


660 


atcggctgcg 


acaacgacga 


gtgccccatc 


gagtggttcc 


acttctcgtg 


cgtggggctc 


720 


aatcataaac 


ccaagggcaa 


gt 








742 


<210> 4 














<211> bo / 














<212> DNA 














<Zio> Homo 


sapiens 












<220> 














<400> 4 












60 


cctccgagaa 


cggtgtccat 


ggcacagggc 


gggaagagat 


aaggcctagg 


gaaggcgccc 


ctcgggccta 


tccacctctt 


ctggggctcg 


gcactaggaa 


gcagcttccc 


tctcaggccc 


120 


ctttgtctcc 


aagccgttcc 


aaactgagta 


ccgggagacg 


acacaaaggg 


agggcggtga 


180 


cggatggcgc 


aggcgcggga 


gccgcctagg 


ctgctgggag 


tggtggtccg 


gccgcggaat 


240 


ggagatcctg 


aaggagctag 


acgagtgcta 


cgagcgcttc 


agtcgcgaga 


cagacggggc 


300 


gcagaagcgg 


cggatgctgc 


actgtgtgca 


gcgcgcgctg 


atccgcagcc 


aggagctggg 


360 


cgacgagaag 


atccagatcg 


tgagccagat 


ggtggagctg 


gtggagaacc 


gcacgcggca 


420 


ggtggacagc 


cacgtggagc 


tgttcgaggc 


gcagcaggag 


ctgggcgaca 


cagcgggcaa 


480 


cagcggcaag 


gctggcgcgg 


acaggcccaa 


aggcgaggcg 


gcagcgcagg 


ctgacaagcc 


540 


caacagcaag 


cgctcacggc 


ggcagcgcaa 


caacgagaac 


cgtgagaacg 


cgtccagcaa 


600 


ccacgaccac 


gacgacggcg 


cctcgggcac 


acccaaggag 


aagaaggcca 


agacctccaa 


660 


gaagaagaag 


cgctccaagg 


ccaaggcgga 


gcgagaggcg 


tcccctgccg 


acctccccat 


720 


cgaccccaac 


gaacccacgt 


actgtctgtg 


caaccaggtc 


tcctatgggg 


agatgatcgg 


780 


ctgcgacaac 


gacgagtgcc 


ccatcgagtg 


gttccacttc 


tcgtgcgtgg 


ggctcaatca 


840 


taaacccaag 


ggcaagt 










857 



<210> 5 
<211> 279 
<212> PRT 

<213> Homo sapiens 

<220> 

<400> 5 

Met Leu Ser Pro Ala Asn Gly Glu Gin Leu His Leu Val Asn Tyr Val 
15 10 15 

Glu Asp Tyr Leu Asp Ser lie Glu Ser Leu Pro Phe Asp Leu Gin Arg 
20 25 30 

Asn Val Ser Leu Met Arg Glu lie Asp Ala Lys Tyr Gin Glu lie Leu 
35 40 45 

Lys Glu Leu Asp Glu Cys Tyr Glu Arg Phe Ser Arg Glu Thr Asp Gly 
50 55 60 

Ala Gin Lys Arg Arg Met Leu His Cys Val Gin Arg Ala Leu lie Arg 
65 70 75 80 

Ser Gin Glu Leu Gly Asp Glu Lys lie Gin lie Val Ser Gin Met Val 
85 90 95 

Glu Leu Val Glu Asn Arg Thr Arg Gin Val Asp Ser His Val Glu Leu 
100 105 110 

Phe Glu Ala Gin Gin Glu Leu Gly Asp Thr Val Gly Asn Ser Gly Lys 
115 120 125 

Val Gly Ala Asp Arg Pro Asn Gly Asp Ala Val Ala Gin Ser Asp Lys 
130 135 140 

Pro Asn Ser Lys Arg Ser Arg Arg Gin Arg Asn Asn Glu Asn Arg Glu 
145 150 155 160 

Asn Ala Ser Ser Asn His Asp His Asp Asp Gly Ala Ser Gly Thr Pro 
165 170 175 

Lys Glu Lys Lys Ala Lys Thr Ser Lys Lys Lys Lys Arg Ser Lys Ala 
180 185 190 

Lys Ala Glu Arg Glu Ala Ser Pro Ala Asp Leu Pro He Asp Pro Asn 
195 200 205 

Glu Pro Thr Tyr Cys Leu Cys Asn Gin Val Ser Tyr Gly Glu Met He 
210 215 220 

Gly Cys Asp Asn Asp Glu Cys Pro He Glu Trp Phe His Phe Ser Cys 
225 230 235 240 

Val Gly Leu Asn His Lys Pro Lys Gly Lys Trp Tyr Cys Pro Lys Cys 
245 250 255 

Arg Gly Glu Asn Glu Lys Thr Met Asp Lys Ala Leu Glu Lys Ser Lys 
260 265 270 

Lys Glu Arg Ala Tyr Asn Arg 
275 



<210> 6 
<211> 210 
<212> PRT 

<213> Homo sapiens 

<220> 

<400> 6 



Met Leu His Cys Val Gin Arg Ala Leu lie Arg Ser Gin Glu Leu Gly 
15 10 15 

Asp Glu Lys lie Gin He Val Ser Gin Met Val Glu Leu Val Glu Asn 
20 25 30 

Arg Thr Arg Gin Val Asp Ser His Val Glu Leu Phe Glu Ala Gin Gin 
35 40 45 

Glu Leu Gly Asp Thr Val Gly Asn Ser Gly Lys Val Gly Ala Asp Arg 
50 55 60 

Pro Asn Gly Asp Ala Val Ala Gin Ser Asp Lys Pro Asn Ser Lys Arg 
65 70 75 80 

Ser Arg Arg Gin Arg Asn Asn Glu Asn Arg Glu Asn Ala Ser Ser Asn 
85 90 95 

His Asp His Asp Asp Gly Ala Ser Gly Thr Pro Lys Glu Lys Lys Ala 
100 105 110 

Lys Thr Ser Lys Lys Lys Lys Arg Ser Lys Ala Lys Ala Glu Arg Glu 
115 120 125 

Ala Ser Pro Ala Asp Leu Pro He Asp Pro Asn Glu Pro Thr Tyr Cys 
130 135 140 

Leu Cys Asn Gin Val Ser Tyr Gly Glu Met He Gly Cys Asp Asn Asp 
145 150 155 160 

Glu Cys Pro He Glu Trp Phe His Phe Ser Cys Val Gly Leu Asn His 
165 170 175 

Lys Pro Lys Gly Lys Trp Tyr Cys Pro Lys Cys Arg Gly Glu Asn Glu 
180 185 190 

Lys Thr Met Asp Lys Ala Leu Glu Lys Ser Lys Lys Glu Arg Ala Tyr 
195 200 205 



Asn Arg 
210 



<210> 7 
<211> 235 
<212> PRT 

<213> Homo sapiens 

<220> 

<400> 7 

Met Glu He Leu Lys Glu Leu Asp Glu Cys Tyr Glu Arg Phe Ser Arg 
15 10 15 

Glu Thr Asp Gly Ala Gin Lys Arg Arg Met Leu His Cys Val Gin Arg 
20 25 30 

Ala Leu He Arg Ser Gin Glu Leu Gly Asp Glu Lys He Gin He Val 
35 40 45 

Ser Gin Met Val Glu Leu Val Glu Asn Arg Thr Arg Gin Val Asp Ser 
50 55 60 

His Val Glu Leu Phe Glu Ala Gin Gin Glu Leu Gly Asp Thr Val Gly 
65 70 75 80 



Asn Ser Gly Lys Val Gly Ala Asp Arg Pro Asn Gly Asp Ala Val Ala 
85 90 95 



Gin Ser Asp Lys Pro Asn Ser Lys Arg Ser Arg Arg Gin Arg Asn Asn 
100 105 110 

Glu Asn Arg Glu Asn Ala Ser Ser Asn His Asp His Asp Asp Gly Ala 
115 120 125 

Ser Gly Thr Pro Lys Glu Lys Lys Ala Lys Thr Ser Lys Lys Lys Lys 
130 135 140 

Arg Ser Lys Ala Lys Ala Glu Arg Glu Ala Ser Pro Ala Asp Leu Pro 
145 150 155 160 

He Asp Pro Asn Glu Pro Thr Tyr Cys Leu Cys Asn Gin Val Ser Tyr 
165 170 175 

Gly Glu Met He Gly Cys Asp Asn Asp Glu Cys Pro He Glu Trp Phe 
180 185 190 

His Phe Ser Cys Val Gly Leu Asn His Lys Pro Lys Gly Lys Trp Tyr 
195 200 205 

Cys Pro Lys Cys Arg Gly Glu Asn Glu Lys Thr Met Asp Lys Ala Leu 
210 215 220 

Glu Lys Ser Lys Lys Glu Arg Ala Tyr Asn Arg 
225 230 235 



<210> 8 
<211> 772 
<212> DNA 
<213> Homo sapiens 
<220> 
<221> CDS 
<222> 695,714 
<400> 8 



aaagcgttct 


cggcggcagc 


gcaacaacta 


gaaccgtgag 


aacgcgtcca 


gcaaccgcga 


60 


cccacgacga 


cgtcacctcg 


ggcacgccca 


aggagaagaa 


agcccagacc 


tctaagaaga 


120 


agcagggctc 


catggccaag 


gcgtagcggc 


aggcgtcccc 


cgcagacctc 


cccatcgacc 


180 


ccagcgagcc 


ctcctactgg 


gagatgatcc 


gctgcgacaa 


cgaatgcccc 


atcgagtggt 


240 


tccgcttctc 


gtgtgtgagt 


ctcaaccata 


aaccaaagcg 


caagtggtac 


tgttccagat 


300 


gccggggaaa 


gaacgatggg 


caaagccctt 


gagaagtcca 


gaaaaaaaac 


agggcttata 


360 


acaggtagtt 


tggggacatg 


cgtctaatag 


tgaggagaac 


aaaataagcc 


agtgtgttga 


420 


ttacattgcc 


acctttgctg 


aggtgcagga 


agtgtaaaat 


gtatattttt 


aaagaatgtt 


480 


gttagaggcc 


gggcgcggtg 


gctcacgcct 


gtaatcccag 


cactttggga 


ggccgaggcg 


540 


gtcggatcac 


gaggtcagga 


gatcgagacc 


atcctggcta 


acacggtgaa 


accccgtctc 


600 


tactaaaaat 


tcaaaaaaaa 


aattagctgg 


gcgtggtggc 


gggcgcctgt 


agtcccagct 


660 


attcgggagg 


ctgaggcagg 


agaatggcnt 


gaacctggga 


ggtggagctt 


gcantgagcc 


720 


aaggtcgcgc 


cactgcactc 


cagcctgggc 


gacagagcga 


gactccatct 


ta 


772 



<210> 9 

<211> 32 

<212> DNA 

<213> Homo sapiens 



<220> 
<400> 9 

cacacaggat ccatgttgag tcctgccaac gg 32 



<210> 10 

<211> 23 

<212> DNA 

<213> Homo sapiens 

<220> 

<400> 10 

cgtggtcgtg gttgctggac gcg 23 



<210> 11 

<211> 21 

<212> DNA 

<213> Homo sapiens 

<220> 

<400> 11 

cccagcggcc ctgacgctgt c 21 



<210> 12 

<211> 23 

<212> DNA 

<213> Homo sapiens 

<220> 

<400> 12 

cgtggtcgtg gttgctggac gcg 23 



<210> 13 

<211> 23 

<212> DNA 

<213> Homo sapiens 

<220> 

<400> 13 

ggaagagata aggcctaggg aag 23 



<210> 14 

<211> 23 

<212> DNA 

<213> Homo sapiens 

<220> 

<400> 14 

cgtggtcgtg gttgctggac gcg 23 



<210> 15 

<211> 2030 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> CDS 

<222> 1628, 1752, 1758, 1769, 1789, 1873, 1908, 1915, 1933, 1970, 1976, 2022 
<400> 15 



ctcgtgccgt 


taaagatggt 


cttctgaagg 


ctaactgcgg 


aatgaaagtt 


tctattccaa 


60 


ctaaagcctt 


agaattgatg 


gacatgcaaa 


ctttcaaagc 


agagcctccc 


gagaagccat 


120 


ctgccttcga 


gcctgccatt 


gaaatgcaaa 


agtctgttcc 


aaataaagcc 


ttggaattga 


180 


agaatgaaca 


aacattgaga 


gcagatgaga 


tactcccatc 


agaatccaaa 


caaaaggact 


240 


atgaagaaag 


ttcttgggat 


tctgagagtc 


tctgtgagac 


tgtttcacag 


aaggatgtgt 


300 



rr1~'f~t~3r'rT'33 

y U L L d L*L* L^dd 


y y \ — i_ o. ' — ci ^ — ci l. 


caaaaacraaa 

V—* t-J* C-J- 1*. y 


tagataaaat 


aaatggaaaa 


ttagaagagt 


360 


L. luu Ly a Laa 


t crataatttt 

Ly a i.y y l l l (— 


ct" craacrcrctc 


ectgeagaat 


gaaagtttct 


attccaacta 


420 


aayLL L Lay u 


3i~lTT3l - crrr3r' 
cii_ Lyu uy y uL/ 


a. t go 3. 3. cl c 1 1 


tcaaagcaga 


gcctcccgag 


aagccatctg 


480 


l-l^ L LL>y dy l^ 


l y L-d u l y dd 


si - crcaasacrt 

Ci \— d t^i ^ 


ctgttccaaa 


taaagccttg 


gaattgaaga 


540 


d LyddL*dddo 


a tt era csr crca 

CL L- L- M d y d y w d 


gatcagatgt 


tcccttcaga 


atcaaaacaa 


aagaaggttg 


600 


aa 9*s 3.3 a t 1 c 


ttgggattct 


gagagtctcc 


atgaqactat 


ttcacagaag 


gatgtgtgtg 


660 


tacccaaggc 


tacacatcaa 


aaagaaatgg 


ataaaataag 


tggaaaatta 


gaagattcaa 


720 


r , 1~3rrr'r't"3lT' 

I CdM LO 1— Q L w 


a aaaatcttcr 

d dd dd L.U ' ^— M 


gatacagttc 


attcttgtga 


aagagcaagg 


gaacttcaaa 


780 


ddy d do uy 


l y d dL. ddL«y l 


3 racrcraaaaa 

d L uy V4 d d d dd 


tggaacaaat 


gaaaaagaag 


ttttgtgtac 


840 


4— /*y ^ ^ ^ n ^ /"y ^ 

igaaaaaydd 


a f" ,, t~n"l~r"" , 3n i 33 
Ly LOdy a. a. 


nT* 3 3 3 3 (T3 33 
y v cicicici.ya.CT.ci. 


t" 3 aaal" caca 


gttagagaac 


caaaaagtta 


900 


d 4— ^ ^ ^ 

aaigg g aaLa 


dy dy l- Lyu 


a nt" rrl - rr3 rra t" 

Qy l~M Lyuy u l 


tgactttaaa 


ccaagaagaa 


gagaagagaa 


960 


-n 4— /-» r*r o 

yaaatyCCya 


i-?»-h;=ii-t-333l~ 
LdLaL Laaa L 


rr333333t"f~3 
y dddddd i— i_d 


acrcrsacraatt 

uyu uuy uti u u 


aggaagaatc 


gaagagcagc 


1020 


ataggaaaga 


y LLdyctciy Ly 


;=l^;i^■33^ , 33^ , 


i~ f rra 3 (~"3 nrrp 


tctcagaata 


caagatatag 


1080 




Ly Layaaciy l 




aggttt ctca 


cactcatgaa 


aatgaaaatt 


1140 




■4— — i -j a a *H +~ n 
tyaaaaLiyL 


d Ly l l. y d d dd 


3 rrrr3 3 3 1~ 1~ nc 


catget aaaa 


ctggaaatag 


1200 


ccacactgaa 


acaccaatac 


caggaaaay y 


dddd Lctdd Ld 


L/ l l l y d y y d l- 


a i""t" a a ct a 1 1 1 

d l d d. M d ^ U- 


1260 


taaaagaaaa 


gaatgctgaa 


o L Ltdya uy d 


LLO Lauaut L 


rr3 3 acfacfQaa 

ydC3.ciya.yyw.w. 


tcattaacta 


1320 


aaagggcatc 


tcaatatagt 


gggcagctta 


ddy LLLLyaL 


ay l uyayaciL 


3r~ , 33'}~fTf" , 't~r , 3 


1380 


cttctaaatt 


gaaggaaaaa 


Cday dLdady 


aaauaLLaya 


rrn'r , 3n'333"t - 1" 
y y ■ ay cici ci i_ l. 


gaatcacacc 


1440 


atcctagact 


ggcttctget 


or+* -3 a /T a r" 1 

Cf LaCadgaLU 


aiyalLduaL 


i~ rrt* rT3 r^f ra 

Ly uy ULG LLCl 


acraaaaacrt c 

uy dddddy \— *w 


1500 


aagaacctgc 


tttccacatt 


gcaggagatg 


4- 4- r-f -|— 4- -t-/-f/-i'a 

L-LLy LtLyod 


3 3 rf 3 3 3 3 3 "f~ (T 
day aaacia y 


aatgttgatg 


1560 


tgagtagtac 


cgatatataa 


caa ugagg uy 


L« LLLaLLaaL 


rpptt't 1 r , +"n , 3 


acrctcaaacrcr 

uy w v_^dd.dy y 


1620 


aaatccanaa 


gectaaaaat 


taatctcaat 


T,augcaggag 


dLyLLL Ldciy 


3rr3333i~3r , 3 


1680 


ttggtttcag 


gaacatgeae 


aaagagacca 


dLy ty ddaLct 


r*3 rrl - rri~ r"3 3 3 
Lay l y l> ociu 


tgaaggaagc 


1740 


tgaacacatg 


l. nr. c a a anc g 


daLaaya Liici 


Ly Ly GCtLQCia. 


cacact ganc 


agcaggagtc 


1800 


tctagatcag 


aaattatttc 


aactacaaag 


caaaaatatg 


tggcttcaac 


agcaattagt 


1860 


tcatgcacat 


aangaaagct 


gacaacaaaa 


gcaagataac 


aattgatntt 


cattntcttg 


1920 


agaggaaaat 


gcncatcatc 


ttctaaaaga 


gaaaaatgag 


gagatatttn 


attacnataa 


1980 


ccatttaaaa 


aaccegtata 


tttcaatatg 


gaaaaaaaaa 


anaaaaaaaa 




2030 



<210> 16 
<211> 528 
<212> PRT 

<213> Homo sapiens 

<220> 

<400> 16 

Met Lys Val Ser lie Pro Thr Lys Ala Leu Glu Leu Met Asp Met Gin 
15 10 15 



Thr Phe Lys Ala Glu Pro Pro Glu Lys Pro Ser Ala Phe Glu Pro Ala 
20 25 30 



lie Glu Met Gin Lys Ser Val Pro Asn Lys Ala Leu Glu Leu Lys Asn 
35 40 45 

Glu Gin Thr Leu Arg Ala Asp Glu lie Leu Pro Ser Glu Ser Lys Gin 
50 55 60 

Lys Asp Tyr Glu Glu Ser Ser Trp Asp Ser Glu Ser Leu Cys Glu Thr 
65 70 75 80 

Val Ser Gin Lys Asp Val Cys Leu Pro Lys Ala Thr His Gin Lys Glu 
85 90 95 

He Asp Lys He Asn Gly Lys Leu Glu Glu Ser Pro Asp Asn Asp Gly 
100 105 110 

Phe Leu Lys Ala Pro Cys Arg Met Lys Val Ser He Pro Thr Lys Ala 
115 120 125 

Leu Glu Leu Met Asp Met Gin Thr Phe Lys Ala Glu Pro Pro Glu Lys 
130 135 140 

Pro Ser Ala Phe Glu Pro Ala He Glu Met Gin Lys Ser Val Pro Asn 
145 150 155 160 

Lys Ala Leu Glu Leu Lys Asn Glu Gin Thr Leu Arg Ala Asp Gin Met 
165 170 175 

Phe Pro Ser Glu Ser Lys Gin Lys Lys Val Glu Glu Asn Ser Trp Asp 
180 185 190 

Ser Glu Ser Leu Arg Glu Thr Val Ser Gin Lys Asp Val Cys Val Pro 
195 200 205 

Lys Ala Thr His Gin Lys Glu Met Asp Lys He Ser Gly Lys Leu Glu 
210 215 220 

Asp Ser Thr Ser Leu Ser Lys He Leu Asp Thr Val His Ser Cys Glu 
225 230 235 240 

Arg Ala Arg Glu Leu Gin Lys Asp His Cys Glu Gin Arg Thr Gly Lys 
260 265 270 

Met Glu Gin Met Lys Lys Lys Phe Cys Val Leu Lys Lys Lys Leu Ser 
275 280 285 

Glu Ala Lys Glu lie Lys Ser Gin Leu Glu Asn Gin Lys Val Lys Trp 
290 295 300 

Glu Gin Glu Leu Cys Ser Val Arg Leu Thr Leu Asn Gin Glu Glu Glu 
305 310 315 320 

Lys Arg Arg Asn Ala Asp He Leu Asn Glu Lys He Arg Glu Glu Leu 
325 330 335 

Gly Arg He Glu Glu Gin His Arg Lys Glu Leu Glu Val Lys Gin Gin 
340 345 350 

Leu Glu Gin Ala Leu Arg He Gin Asp He Glu Leu Lys Ser Val Glu 
355 360 365 

Ser Asn Leu Asn Gin Val Ser His Thr His Glu Asn Glu Asn Tyr Leu 
370 375 380 

Leu His Glu Asn Cys Met Leu Lys Lys Glu He Ala Met Leu Lys Leu 



385 



390 



395 



400 



Glu lie Ala Thr Leu Lys His Gin 
405 

Phe Glu Asp lie Lys He Leu Lys 
420 

Thr Leu Lys Leu Lys Glu Glu Ser 
435 440 

Ser Gly Gin Leu Lys Val Leu He 
450 455 

Lys Leu Lys Glu Lys Gin Asp Lys 
465 470 

Ser His His Pro Arg Leu Ala Ser 
485 

Val Thr Ser Arg Lys Ser Gin Glu 
500 

Ala Cys Leu Gin Arg Lys Met Asn 
515 520 



Tyr Gin Glu Lys Glu Asn Lys Tyr 
410 415 

Glu Lys Asn Ala Glu Leu Gin Met 
425 430 

Leu Thr Lys Arg Ala Ser Gin Tyr 
445 

Ala Glu Asn Thr Met Leu Thr Ser 
460 

Glu He Leu Glu Ala Glu He Glu 

475 480 

Ala Val Gin Asp His Asp Gin He 
490 495 

Pro Ala Phe His He Ala Gly Asp 
505 510 

Val Asp Val Ser Ser Thr Asp He 
525 



<210> 17 

<211> 33 

<212> DNA 

<213> Homo sapiens 

<220> 

<400> 17 

cacacaggat ccatgcaggc cccgcacaag gag 33 



<210> 18 

<211> 34 

<212> DNA 

<213> Homo sapiens 

<220> 

<400> 18 

cacacaaagc ttctaggatt tggcacagcc agag 34 



<210> 19 
<211> 294 
<212> PRT 

<213> Homo sapiens 

<220> 

<400> 19 

Met Pro Leu Cys Thr Ala Thr Arg He Pro Arg Tyr Ser Ser Ser Ser 
15 10 15 

Asp Pro Gly Pro Val Ala Arg Gly Arg Gly Cys Ser Ser Asp Arg Leu 
20 25 30 

Pro Arg Pro Ala Gly Pro Ala Arg Arg Gin Phe Gin Ala Ala Ser Leu 
35 40 45 

Leu Thr Arg Gly Trp Gly Arg Ala Trp Pro Trp Lys Gin He Leu Lys 
50 55 60 

Glu Leu Asp Glu Cys Tyr Glu Arg Phe Ser Arg Glu Thr Asp Gly Ala 
65 70 75 80 

Gin Lys Arg Arg Met Leu His Cys Val Gin Arg Ala Leu He Arg Ser 



85 



90 



95 



Gin Glu Leu Gly Asp Glu Lys He Gin He Val Ser Gin Met Val Glu 
100 105 110 

Leu Val Glu Asn Arg Thr Arg Gin Val Asp Ser His Val Glu Leu Phe 
115 120 125 

Glu Ala Gin Gin Glu Leu Gly Asp Thr Val Gly Asn Ser Gly Lys Val 
130 135 140 

Gly Ala Asp Arg Pro Asn Gly Asp Ala Val Ala Gin Ser Asp Lys Pro 
145 150 155 160 

Asn Ser Lys Arg Ser Arg Arg Gin Arg Asn Asn Glu Asn Arg Glu Asn 
165 170 175 

Ala Ser Ser Asn His Asp His Asp Asp Gly Ala Ser Gly Thr Pro Lys 
180 185 190 

Glu Lys Lys Ala Lys Thr Ser Lys Lys Lys Lys Arg Ser Lys Ala Lys 
195 200 205 

Ala Glu Arg Glu Ala Ser Pro Ala Asp Leu Pro He Asp Pro Asn Glu 
210 215 220 

Pro Thr Tyr Cys Leu Cys Asn Gin Val Ser Tyr Gly Glu Met He Gly 
225 230 235 240 

Cys Asp Asn Asp Glu Cys Pro lie Glu Trp Phe His Phe Ser Cys Val 
245 250 255 

Gly Leu Asn His Lys Pro Lys Gly Lys Trp Tyr Cys Pro Lys Cys Arg 
260 265 270 

Gly Glu Asn Glu Lys Thr Met Asp Lys Ala Leu Glu Lys Ser Lys Lys 
275 280 285 

Glu Arg Ala Tyr Asn Arg 
290 294 



<210> 20 

<211> 22 

<212> DNA 

<213> Homo sapiens 

<220> 

<400> 20 

caagcagag cctcccgaga ag 22 

<210> 21 

<211> 24 

<212> DNA 

<213> Homo sapiens 

<220> 

<400> 21 

cctatgctgc tcttcgattc ttcc 24 



<210> 22 

<211> 4095 

<212> DNA 

<213> Homo sapiens 

<220> 

<400> 22 

ctagtctata cagcaacgac cctacatcgt cactctgggg tcttagaaag tccataaagc 



tgcctcccgg 


gacaagtccg 


aagctggaga 


at Fit riCrLrlGel 
la l. la uouuay o 


crpcpacraacra 

y V— ' *w w la y la la y la 


gaetgetcta 
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yy lqu Quay a 


a rrt rrpp ^ rrrt" 

y y y 
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y a lu UU.M y \j u. 


t" P a 3 t ClCCFh 

L- Gt LrO, LA L*- y w L-*LA 


caggaggctt 


1" aaatct cert 

<— LA LA LA L- V* L~ ^ y 


gatgtg ta tg 


caacatggct 


gtggtggcaa 


actgc tg tec 


atggtgcagt 


acttttacta 


ccataacgaa 


agaagtgagc 


a 1" cr p era a 1" cr c 


gt taataagt 


taaatgeaca 


rrp rrp t" ^5 rrt t rr 


catciT'ttctt 


agcaaaatgt 


L. y ^ Ciy CICIO CL L 


a t err* t crt t a c 


tgtggatttc 


rr^ ^ a attatp 

y dCl a<X L. L la L O 


aaaaa tratc 

CA LA LA LA LA L>L^LA L» 


aaataccaa t 


rrptrrppjpppt 


rrrrrrrrra a a era 

y y o y y act CAy la 


cacctgacac 


rfrrp't' rrpp. ppp 

UUU LU ^ Ca OOO 


t rrrrt rrrr^ a a rr 

uy y L-yycACACAy 


acacctgaca 


dy y o LyLa lu 


l Ly y Lyy ayy 


a^patptcrac 

LALa^Ca L- y U.L* 


aay l Lccjaac 


y LOdyodydd 


ctctctoctoo Lciy 


gaaatttacg 


ggccagcaaa 


y y ddy ctoo Ld 


ctagggaaat 


a tgay tcccg 


a aaa/raaa/'-'a: 
ddddy dddOd 


cct aggaaga 


oyodLyyydy 


a a a a a frp. P. ^ P 

dddddyddcio 


a Ll Ldd Lddd 


O L-CtCtCty L L L L 


y CACACiciCAy y laca 


La, LL La. Octet a. 


yoctcty ty llu 


t rrat pacracrcr 

L.yCA l. v^CAy la y y 


yaaLdLLLLL 


l y d l lo Loyy 


rrtpt pt 1 1 rra 

y LOLOLLLyCA 


/-< +■ a +- a +- -J 4- 

gLLLaLaLal. 


addday LddL 


ydyd l ddd Ld 


ccttcaagcc 


gecattgaaa 


gcaaaac Co L. 


caaacattga 


agcagatccg 


tgt tcccacc 


ttgggattct 


agagtctctg 


gagactgttt 


atcaaaaaga 


atagataaaa 


a a a +" /"f/T a a a a 

aaauy y aaaa 


aaggctacct 


/"Y a a "t~ a a 3 

Ly y aa Ly ctcict 


LLLOLdLLOO 


tt teaaageg 


agcctccggg 


a -3 a "i - i - "" "f~ it 

ddyoodLOLy 


tcccaaataa 


gccttggaat 


✓raaaaa"H("Yaa 

gaaaaa ty ad 


gaatccaaac 


aaaggactat 


aagaaaattc 


acagaaggat 


tgtgtttacc 


aaggctgege 


tagaagggtc 


cctgttaaag 


tggtcttctg 


actaaagect 


agaattgatg 


acatgeaaac 


cttcgagcct 


ccattgaaat 


caaaagtctg 



gatgacaaag 


ggaagaagac 


atcaacctta 


120 


actcrtrcrcctcr 


crtcaatcrcrcc 


tcracrcraacrta 


180 


v-j u. o w ^ l. o y 


1~crcrro"3 Picric 


rrcrar , 3r , p1"pt' 


240 


tgcaaatatt 


tgatagattc 


ggtgccgata 


300 


tecattatge 


gtttatagtg 


gattttgtca 


360 


atcgaagtgc 


caacaaggct 


gcctcacacc 


420 


aattgtggaa 


ttttgetgat 


aaaaatgcaa 


480 
540 


ccctcatgct 


gctgtatgtc 


tggatcatca 


600 


gaegtet t tg 


tgcagatata 


gtggagtaac 


660 


tcacattcat 


aacaaattat 


gaatatatac 


720 


patraacrcraac 

v^LAy uuy y la la 


tetgeaggaa 


acctgatgag 


780 


rrptn i aaacrr ,, l~ 

y w L^y LACALAy l* 


ggtggaaaaa 


cacctgatga 


840 
900 


ggctgaaagc 


tggtggaaaa 


acacctgatg 


dddL LOdd Ly 


1 1 CfCIPi CTFi aflfl 

yyy y 


crapatctcrcra 

y la l^-la ' l^ y y ca 




ydddLLdoyd 


loo Lyodddd 


caciciola lo l- y ca 


960 


yddy dLOyOd 


yyydyddddd 


yddydododo 


1020 


/rf - i*rarra a a ■h'h 
Ltydyddd L L 


doy Lyyyody 


ddddyyddyd 


1080 


OOLy Udddyd 


oyyduyoy Ly 


/"•a a fya a a 
ocACiycAy u- a. ci \^ 


1140 


a. lo Lddyd l y 


1 1 (TP ^ t rrt pp 
l. uyociLy *— 


acaaaagaat 


1200 


LOOOdLOdyd 


L O O LA CL CA L^-CA CAy 


crcraacratcraa 

y y GLCAy la L^y la la 


1260 


■prrttptrrP^iP, 
d y l. LOLyociLA 


p;a 1 1 cpi ricrt cr 

y la L- ^ — ' la LAy L- y 


gtatacct ga 


1320 


dyddy Ldy dd 


dyoo loo lclca 


aarrrrsfrtrf 
ci d y v^^ci y 


1380 


l tocaaa L.aa 


y oO l l Ly dd l 


yddydduyad 


14 40 


gaatccaaac 


aaaggactat 


a a /^f a a a a "l~"h r* 

daCjdddd L LO 


1 ^nn 

lOUU 


acagaaggat 


tgtgtttacc 


a a rf /~i 4— a a ^ 

aay gc tacat 


1 JuU 


nagaagag lo 


OO Lad Udddy 


Lyy LOLLOLy 


1620 


dCIaaayCLL 


dyad l uyddy 


dod Ly odddo 


1680 


4— 4— y — i -7, « « 4J— 

OLLoydyOO L 


OOclO Ly add L 


V^CLCALACLy L. ^ l^ y 


1740 


aaacatggag 


gcay a oy ay a 


do LOOOdLOd 


1800 


tgggatactg 


gagtctctgt 


dydL Ly L L LO 


lOOU 


tcaaaaagaa 


tagataaaat 


aatggaaaat 


1920 


aggctaactg 


ggaatgaaag 


ttctattcca 


1980 


ttcaaagcag 


gcctcccgag 


agccatctgc 


2040 


tccaaataaa 


ccttggaatt 


aagaatgaac 


2100 



aaacattgag 


gcagatgaga 


actcccatca 


tgggattctg 


gagtctctgt 


agactgtttc 


tcaaaaagaa 


tagataaaat 


aatggaaaat 


aggctccctg 


agaatgaaag 


ttctattcca 


ttcaaagcag 


gcctcccgag 


agccatctgc 


tccaaataaa 


ccttggaatt 


aagaatgaac 


aatcaaaaca 


aagaaggttg 


agaaaattct 


cagaaggatg 


gtgtgtaccc 


aggctacaca 


agaagattca 


ctagcctatc 


aaaatcttgg 


ttcaaaaaga 


cactgtgaac 


acgtacagga 


ctgaaaaaga 


actgtcagaa 


caaaagaaat 


ggaacaagag 


tctgcagtgt 


agattgactt 


atatattaaa 


gaaaaaatta 


ggaagaatta 


gaagtgaaac 


acaacttgaa 


aggctctcag 


taatttgaat 


aggtttctca 


actcatgaaa 


tgaaaaagga 


attgccatgc 


aaaactggaa 


gaaaataaat 


ctttgaggac 


ttaagatttt 


aaaactgaaa 


aggaatcatt 


actaaaaggg 


tagctgagaa 


acaatgctca 


ttctaaattg 


gaaattgaat 


acaccatcct 


gactggcttc 


aagaaaaagt 


aagaacctgc 


ttccacattg 


ttgatgtgag 


agtacgatat 


taacaatgag 


aaatccaaaa 


cctaaaaatt 


atctcaatta 


ttcagaacat 


cacaaagaga 


caacgtgaaa 


atcaaaacga 


caagataatg 


gaacaaacac 


tttcaactac 


aagcaaaaat 


tgtggcttca 


caacaaaagc 


agataacaat 


gatattcatt 


aagagaaaaa 


gaggagatat 


taattacaat 


aaagagaaag 


agaaacagaa 


actcatgaga 


acagaccaga 


ctttactcac 


actcatgcta 


atcttaccaa 


agtctgtgtc 


acagaatact 


aagcctacag 


cataaaataa 


agtgtgaaga 


ggattcccat 


taccctgatg 


tgcagcagac 



aatccaaaca 


aaggactatg 


agaaagttct 


2160 


cagaaggatg 


gtgtttaccc 


aggctacaca 


2220 


agaagagtct 


ctgataatga 


ggttttctga 


2280 


ctaaagcctt 


gaattgatgg 


catgcaaact 


2340 


ttcgagcctg 


cattgaaatg 


aaaagtctgt 


2400 


aacattgaga 


cagatcagat 


ttcccttcag 


2460 


gggattctga 


agtctccgtg 


gactgtttca 


2520 


caaaaagaaa 


ggataaaata 


gtggaaaatt 


2580 


tacagttcat 


cttgtgaaag 


gcaagggaac 


2640 


aaatggaaca 


atgaaaaaga 


gttttgtgta 


2700 


aaatcacagt 


agagaaccaa 


aagttaaatg 


2760 


aaaccaagaa 


aagagaagag 


agaaatgccg 


2820 


gaagaatcga 


gagcagcata 


gaaagagtta 


2880 


atacaagata 


agaattgaag 


gtgtagaaag 


2940 


tgaaaattat 


tcttacatga 


aattgcatgt 


3000 


tagccacact 


aaacaccaat 


ccaggaaaag 


3060 


aaagaaaaga 


tgctgaactt 


agatgaccct 


3120 


atctcaatat 


gtgggcagct 


aaagttctga 


3180 


aggaaaaaca 


gacaaagaaa 


actagaggca 


3240 


gctgtacaag 


ccatgatcaa 


ttgtgacatc 


3300 


aggagatgct 


gtttgcaaag 


aaaatgaatg 


3360 


tgctccatca 


ccactttctg 


agctcaaagg 


3420 


gcaggagatg 


tctaagagaa 


atacattggt 


3480 


acagtgtcaa 


tgaaggaagc 


gaacacatgt 


3540 


ctgaacagca 


gagtctctag 


tcagaaatta 


3600 


cagcaattag 


tcatgcacat 


agaaagctga 


3660 


tcttgagagg 


aaatgcaaca 


catctcctaa 


3700 


accatttaaa 


aaccgtatat 


tcaatatgaa 


3760 


acaagcagta 


gaaacttctt 


tggagaaaca 


3820 


gaggccagtc 


tagcatcacc 


tatgttgaaa 


3880 


attttagaag 


aaaattcatg 


tttcttcctg 


3940 


ttacttgttc 


cgaattgcat 


aagctgcaca 


4000 


tcattcaatc 


aaccagaatc 


cgctctgcac 


4060 



tccagcctag tgacagagtg gactccacct ggaaa 4095 

<210> 23 
<211> 1341 
<212> PRT 

<213> Homo sapiens 

<220> 

<400> 23 

Met Thr Lys Arg Lys Lys Thr lie Asn Leu Asn lie Gin Asp Ala Gin 
15 10 15 

Lys Arg Thr Ala Leu His Trp Ala Cys Val Asn Gly His Glu Glu Val 
20 25 30 

Val Thr Phe Leu Val Asp Arg Lys Cys Gin Leu Asp Val Leu Asp Gly 
35 40 45 

Glu His Arg Thr Pro Leu Met Lys Ala Leu Gin Cys His Gin Glu Ala 
50 55 60 

Cys Ala Asn lie Leu He Asp Ser Gly Ala Asp He Asn Leu Val Asp 
65 70 75 80 

Val Tyr Gly Asn Met Ala Leu His Tyr Ala Val Tyr Ser Glu He Leu 
85 90 95 

Ser Val Val Ala Lys Leu Leu Ser His Gly Ala Val He Glu Val His 
100 105 110 

Asn Lys Ala Ser Leu Thr Pro Leu Leu Leu Ser He Thr Lys Arg Ser 
115 120 125 

Glu Gin He Val Glu Phe Leu Leu He Lys Asn Ala Asn Ala Asn Ala 
130 135 140 

Val Asn Lys Tyr Lys Cys Thr Ala Leu Met Leu Ala Val Cys His Gly 
145 150 155 160 

Ser Ser Glu He Val Gly Met Leu Leu Gin Gin Asn Val Asp Val Phe 
165 170 175 

Ala Ala Asp He Cys Gly Val Thr Ala Glu His Tyr Ala Val Thr Cys 
180 185 190 

Gly Phe His His He His Glu Gin He Met Glu Tyr He Arg Lys Leu 
195 200 205 

Ser Lys Asn His Gin Asn Thr Asn Pro Glu Gly Thr Ser Ala Gly Thr 
210 215 220 

Pro Asp Glu Ala Ala Pro Leu Ala Glu Arg Thr Pro Asp Thr Ala Glu 
225 230 235 240 

Ser Leu Val Glu Lys Thr Pro Asp Glu Ala Ala Pro Leu Val Glu Arg 
245 250 255 

Thr Pro Asp Thr Ala Glu Ser Leu Val Glu Lys Thr Pro Asp Glu Ala 
260 265 270 

Ala Ser Leu Val Glu Gly Thr Ser Asp Lys He Gin Cys Leu Glu Lys 
275 280 285 

Ala Thr Ser Gly Lys Phe Glu Gin Ser Ala Glu Glu Thr Pro Arg Glu 
290 295 300 

He Thr Ser Pro Ala Lys Glu Thr Ser Glu Lys Phe Thr Trp Pro Ala 
305 310 315 320 



Lys Gly Arg Pro Arg Lys lie Ala Trp Glu Lys Lys Glu Asp Thr Pro 
325 330 335 

Arg Glu He Met Ser Pro Ala Lys Glu Thr Ser Glu Lys Phe Thr Trp 
340 345 350 

Ala Ala Lys Gly Arg Pro Arg Lys He Ala Trp Glu Lys Lys Glu Thr 
355 360 365 

Pro Val Lys Thr Gly Cys Val Ala Arg Val Thr Ser Asn Lys Thr Lys 
370 375 380 

Val Leu Glu Lys Gly Arg Ser Lys Met He Ala Cys Pro Thr Lys Glu 
385 390 395 400 

Ser Ser Thr Lys Ala Ser Ala Asn Asp Gin Arg Phe Pro Ser Glu Ser 
405 410 415 

Lys Gin Glu Glu Asp Glu Glu Tyr Ser Cys Asp Ser Arg Ser Leu Phe 
420 425 430 

Glu Ser Ser Ala Lys He Gin Val Cys lie Pro Glu Ser He Tyr Gin 
435 440 445 

Lys Val Met Glu He Asn Arg Glu Val Glu Glu Pro Pro Lys Lys Pro 
450 455 460 

Ser Ala Phe Lys Pro Ala He Glu Met Gin Asn Ser Val Pro Asn Lys 
465 470 475 480 

Ala Phe Glu Leu Lys Asn Glu Gin Thr Leu Arg Ala Asp Pro Met Phe 
485 490 495 

Pro Pro Glu Ser Lys Gin Lys Asp Tyr Glu Glu Asn Ser Trp Asp Ser 
500 505 510 

Glu Ser Leu Cys Glu Thr Val Ser Gin Lys Asp Val Cys Leu Pro Lys 
515 520 525 

Ala Thr His Gin Lys Glu He Asp Lys He Asn Gly Lys Leu Glu Glu 
530 535 540 

Ser Pro Asn Lys Asp Gly Leu Leu Lys Ala Thr Cys Gly Met Lys Val 
545 550 555 560 

Ser He Pro Thr Lys Ala Leu Glu Leu Lys Asp Met Gin Thr Phe Lys 
565 570 575 

Ala Glu Pro Pro Gly Lys Pro Ser Ala Phe Glu Pro Ala Thr Glu Met 
580 585 590 

Gin Lys Ser Val Pro Asn Lys Ala Leu Glu Leu Lys Asn Glu Gin Thr 
595 600 605 

Trp Arg Ala Asp Glu He Leu Pro Ser Glu Ser Lys Gin Lys Asp Tyr 
610 615 620 

Glu Glu Asn Ser Trp Asp Thr Glu Ser Leu Cys Glu Thr Val Ser Gin 
625 630 635 640 

Lys Asp Val Cys Leu Pro Lys Ala Ala His Gin Lys Glu He Asp Lys 
645 650 655 

He Asn Gly Lys Leu Glu Gly Ser Pro Val Lys Asp Gly Leu Leu Lys 
660 665 670 

Ala Asn Cys Gly Met Lys Val Ser He Pro Thr Lys Ala Leu Glu Leu 



675 680 685 

Met Asp Met Gin Thr Phe Lys Ala Glu Pro Pro Glu Lys Pro Ser Ala 
690 695 700 

Phe Glu Pro Ala lie Glu Met Gin Lys Ser Val Pro Asn Lys Ala Leu 
705 710 715 720 

Glu Leu Lys Asn Glu Gin Thr Leu Arg Ala Asp Glu He Leu Pro Ser 
725 730 735 

Glu Ser Lys Gin Lys Asp Tyr Glu Glu Ser Ser Trp Asp Ser Glu Ser 
740 745 750 

Leu Cys Glu Thr Val Ser Gin Lys Asp Val Cys Leu Pro Lys Ala Thr 
755 760 765 

His Gin Lys Glu He Asp Lys He Asn Gly Lys Leu Glu Glu Ser Pro 
770 775 780 

Asp Asn Asp Gly Phe Leu Lys Ala Pro Cys Arg Met Lys Val Ser He 
785 790 795 800 

Pro Thr Lys Ala Leu Glu Leu Met Asp Met Gin Thr Phe Lys Ala Glu 
805 810 815 

Pro Pro Glu Lys Pro Ser Ala Phe Glu Pro Ala He Glu Met Gin Lys 
820 825 830 

Ser Val Pro Asn Lys Ala Leu Glu Leu Lys Asn Glu Gin Thr Leu Arg 
835 840 845 

Ala Asp Gin Met Phe Pro Ser Glu Ser Lys Gin Lys Lys Val Glu Glu 
850 855 860 

Asn Ser Trp Asp Ser Glu Ser Leu Arg Glu Thr Val Ser Gin Lys Asp 
865 870 875 880 

Val Cys Val Pro Lys Ala Thr His Gin Lys Glu Met Asp Lys He Ser 
885 890 895 

Gly Lys Leu Glu Asp Ser Thr Ser Leu Ser Lys He Leu Asp Thr Val 
900 905 910 

His Ser Cys Glu Arg Ala Arg Glu Leu Gin Lys Asp His Cys Glu Gin 
915 920 925 

Arg Thr Gly Lys Met Glu Gin Met Lys Lys Lys Phe Cys Val Leu Lys 
930 935 940 

Lys Lys Leu Ser Glu Ala Lys Glu He Lys Ser Gin Leu Glu Asn Gin 
945 950 955 960 

Lys Val Lys Trp Glu Gin Glu Leu Cys Ser Val Arg Leu Thr Leu Asn 
965 970 975 

Gin Glu Glu Glu Lys Arg Arg Asn Ala Asp He Leu Asn Glu Lys He 
980 985 990 

Arg Glu Glu Leu Gly Arg He Glu Glu Gin His Arg Lys Glu Leu Glu 
995 1000 1005 

Val Lys Gin Gin Leu Glu Gin Ala Leu Arg He Gin Asp He Glu Leu 
1010 1015 1020 

Lys Ser Val Glu Ser Asn Leu Asn Gin Val Ser His Thr His Glu Asn 
1025 1030 1035 1040 



Glu Asn Tyr Leu Leu His Glu Asn Cys Met Leu Lys Lys Glu He Ala 
1045 1050 1055 



Met Leu Lys Leu Glu He Ala Thr Leu Lys His Gin Tyr Gin Glu Lys 
1060 1065 1070 

Glu Asn Lys Tyr Phe Glu Asp He Lys He Leu Lys Glu Lys Asn Ala 
1075 1080 1085 

Glu Leu Gin Met Thr Leu Lys Leu Lys Glu Glu Ser Leu Thr Lys Arg 
1090 1095 1100 

Ala Ser Gin Tyr Ser Gly Gin Leu Lys Val Leu He Ala Glu Asn Thr 
1105 1110 1115 1120 

Met Leu Thr Ser Lys Leu Lys Glu Lys Gin Asp Lys Glu He Leu Glu 
1125 1130 1135 

Ala Glu lie Glu Ser His His Pro Arg Leu Ala Ser Ala Val Gin Asp 
1140 1145 1150 

His Asp Gin He Val Thr Ser Arg Lys Ser Gin Glu Pro Ala Phe His 
1155 1160 1165 

lie Ala Gly Asp Ala Cys Leu Gin Arg Lys Met Asn Val Asp Val Ser 
1170 1175 1180 

Ser Thr lie Tyr Asn Asn Glu Val Leu His Gin Pro Leu Ser Glu Ala 
1185 1190 1195 1200 

Gin Arg Lys Ser Lys Ser Leu Lys lie Asn Leu Asn Tyr Ala Gly Asp 
1205 1210 1215 

Ala Leu Arg Glu Asn Thr Leu Val Ser Glu His Ala Gin Arg Asp Gin 
1220 1225 1230 

Arg Glu Thr Gin Cys Gin Met Lys Glu Ala Glu His Met Tyr Gin Asn 
1235 1240 1245 

Glu Gin Asp Asn Val Asn Lys His Thr Glu Gin Gin Glu Ser Leu Asp 
1250 1255 1260 

Gin Lys Leu Phe Gin Leu Gin Ser Lys Asn Met Trp Leu Gin Gin Gin 
1265 1270 1275 1280 

Leu Val His Ala His Lys Lys Ala Asp Asn Lys Ser Lys lie Thr lie 
1285 1290 1295 

Asp lie His Phe Leu Glu Arg Lys Met Gin His His Leu Leu Lys Glu 
1300 1305 1310 

Lys Asn Glu Glu lie Phe Asn Tyr Asn Asn His Leu Lys Asn Arg lie 
1315 1320 1325 

Tyr Gin Tyr Glu Lys Glu Lys Ala Glu Thr Glu Asn Ser 
1330 1335 1340 



<210> 23 

<211> 24 

<212> DNA 

<213> Homo sapiens 

<220> 

<400> 23 

aatgggaa4a agagctctgc ag 22 
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<211> 3673 












<213> Homo 
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<220> 






<221> CDS 






<999> 4^9 


47^ 1 789 




<400> 26 






uday cty u l Ly 


y uy cl l. ct^ciy a 




or or or o< of ^ or -f— rt- 

yyycyay Lyo 


gagocyyyyy 


uy yy Ly u uyy 


gtaggggctg 


gggaagggcy 


d.y cyyydyyo 


atgaagaggc 


tcttagctgc 


cyuLyyuddy 


ttcagcgaac 


gggtctacac 


4* or or -n /y or --^ o* 

Ly agaaggac 


aagatccata 


cagctgcctc 


ccggggccaa 


aagaagcccg 


tcaacctgaa 


caaaayayau 


tgtgtcaatg 


gccatgcana 


ay Lay Laata 


gtccttgatg 


gcgaagggag 


/-t o* r~~* /■*■ ~\~ {"* r*f 

yaudLL Lt ty 


ttgtgcaaat 


att ctcatag 


angc uggi-gc 


cacggctctc 


cattatgccg 


tttatagtga 


ctatggtgca 


gtcatcgagg 


tgcaaaacaa 


acagaaaaga 


agcaagcaaa 


ctgtggaatt 


atttaatgag 


tctaaatgca 


cagccctcat 


agtcggcatg 


cttcttcagc 


aaaatgttga 


tgcagaacgt 


tatgctgctg 


ctcgtggagt 


tatacgaaaa 


ttacctaaaa 


atcctcaaaa 


acctgatgag 


gctgcaccct 


tggcggaaag 


aaaaacacct 


gacgaggctg 


cacgcttggt 


ggggaaagca 


acatctggaa 


agtttgaaca 


gaggcctaca 


aaagaaacat 


ctgagaaatt 


gatcacatgg 


gaggaaaaag 


aaacatctgt 


taataaaact 


gaagttttgg 


aaaaaggaac 


aacatctaca 


aaagcaagta 


caaatgtgga 


ttttggcaca 


cggactattg 


aaaattcaca 


tgctaccaag 


attatctcta 


agagtgctgc 



ggtgttgggg 


cgggtgcggg 


aactgaagac 




ggaagggtaa 


gcgggaagcg 


agggcgaggg 


i on 
±<£ U 


gcgggctctc 


tctagcaggg 


ggctgcagcc 


1 OA 

loU 


ggcgtgcggg 


gcccggagcc 


cccgaacccc 


240 


tacgggacca 


tctacttcgg 


ggatctaggg 


300 


gtccagaagc 


tggagaagat 


gacagtaggg 


360 


atgaagaaga 


ggactgctct 


acactgggcc 


420 


tttctggtag 


acagaaagtg 


ccngcttaat 


480 


atgaaggctc 


tacaatgcga 


gagggaagct 


540 


tgatctaaat 


tatgtagatg 


tgtatggcaa 


600 


gaatttatta 


atggtggcaa 


cactgctgtc 


660 


ggctagcctc 


acaccccttt 


tactggccat 


720 


tttactaaca 
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atgcaaacgc 


780 


gcttgccata 


tgtgaaggct 


catcagagat 


840 


cgtctttgct 


gaagacatac 


atggaataac 


900 


taattacatt 


catcaacaac 


ttttggaaca 


960 


taccaatcca 


gaaggaacat 


ctacaggaac 


1020 


aacacctgac 


acggctgaaa 


gcttgctgga 


1080 


ggagggaacg 


tctgccaaaa 


ttcaatgtct 


1140 


gtcaacagaa 


gaaacaccta 


ggaaaatttt 


1200 


ttcatggcca 


gcaaaagaaa 


gatctaggaa 


1260 


aaagactgaa 


tgcgtggcag 


gagtaacacc 


1320 


atctaatatg 


attgcatgtc 


ctacaaaaga 


1380 


tgtgagttct 


gtagagccta 


tattcagtct 


1440 


gtgtacaaaa 


gttgaggaag 


actttaatct 
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acagaattat 


acgtgtttac 
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1800 


gactgttcca 
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ttgaattgaa 
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acattgagag 
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gaatccaaac 
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ggcatctcag tatagagagc agcttaaagt tctgacggca gagaacacga tgctgacttc 3660 
taaattgaag gaa 3673 



<210> 27 
<211> 1011 
<212> PRT 

<213> Homo sapiens 

<220> 

<400> 27 

Met Val Ala Thr Leu Leu Ser Tyr Gly Ala Val He Glu Val Gin Asn 
15 10 15 

Lys Ala Ser Leu Thr Pro Leu Leu Leu Ala He Gin Lys Arg Ser Lys 
20 25 30 

Gin Thr Val Glu Phe Leu Leu Thr Lys Asn Ala Asn Ala Asn Ala Phe 
35 40 45 

Asn Glu Ser Lys Cys Thr Ala Leu Met Leu Ala He Cys Glu Gly Ser 
50 55 60 

Ser Glu He Val Gly Met Leu Leu Gin Gin Asn Val Asp Val Phe Ala 
65 70 75 80 

Glu Asp He His Gly He Thr Ala Glu Arg Tyr Ala Ala Ala Arg Gly 
85 90 95 

Val Asn Tyr He His Gin Gin Leu Leu Glu His He Arg Lys Leu Pro 
100 105 110 

Lys Asn Pro Gin Asn Thr Asn Pro Glu Gly Thr Ser Thr Gly Thr Pro 
115 120 125 

Asp Glu Ala Ala Pro Leu Ala Glu Arg Thr Pro Asp Thr Ala Glu Ser 
130 135 140 

Leu Leu Glu Lys Thr Pro Asp Glu Ala Ala Arg Leu Val Glu Gly Thr 
145 150 155 160 

Ser Ala Lys He Gin Cys Leu Gly Lys Ala Thr Ser Gly Lys Phe Glu 
165 170 175 

Gin Ser Thr Glu Glu Thr Pro Arg Lys He Leu Arg Pro Thr Lys Glu 
180 185 190 

Thr Ser Glu Lys Phe Ser Trp Pro Ala Lys Glu Arg Ser Arg Lys He 
195 200 205 

Thr Trp Glu Glu Lys Glu Thr Ser Val Lys Thr Glu Cys Val Ala Gly 
210 215 220 

Val Thr Pro Asn Lys Thr Glu Val Leu Glu Lys Gly Thr Ser Asn Met 
225 230 235 240 

He Ala Cys Pro Thr Lys Glu Thr Ser Thr Lys Ala Ser Thr Asn Val 
245 250 255 

Asp Val Ser Ser Val Glu Pro He Phe Ser Leu Phe Gly Thr Arg Thr 
260 265 270 

He Glu Asn Ser Gin Cys Thr Lys Val Glu Glu Asp Phe Asn Leu Ala 
275 280 285 

Thr Lys He He Ser Lys Ser Ala Ala Gin Asn Tyr Thr Cys Leu Pro 
290 295 300 



Asp Ala Thr Tyr Gin Lys Asp He Lys Thr He Asn His Lys He Glu 
305 310 315 320 



Asp Gin Met Phe Pro Ser Glu Ser Lys Arg Glu Glu Asp Glu Glu Tyr 
325 330 335 

Ser Trp Asp Ser Gly Ser Leu Phe Glu Ser Ser Ala Lys Thr Gin Val 
340 345 350 

Cys He Pro Glu Ser Met Tyr Gin Lys Val Met Glu He Asn Arg Glu 
355 360 365 

Val Glu Glu Leu Pro Glu Lys Pro Ser Ala Phe Lys Pro Ala Val Glu 
370 375 380 

Met Gin Lys Thr Val Pro Asn Lys Ala Phe Glu Leu Lys Asn Glu Gin 
385 390 395 400 

Thr Leu Arg Ala Ala Gin Met Phe Pro Ser Glu Ser Lys Gin Lys Asp 
405 410 415 

Asp Glu Glu Asn Ser Trp Asp Ser Glu Ser Pro Cys Glu Thr Val Ser 
420 425 430 

Gin Lys Asp Val Tyr Leu Pro Lys Ala Thr His Gin Lys Glu Phe Asp 
435 440 445 

Thr Leu Ser Gly Lys Leu Glu Glu Ser Pro Val Lys Asp Gly Leu Leu 
450 455 460 

Lys Pro Thr Cys Gly Arg Lys Val Ser Leu Pro Asn Lys Ala Leu Glu 
465 470 475 480 

Leu Lys Asp Arg Glu Thr Phe Lys Ala Glu Ser Pro Asp Lys Asp Gly 
485 490 495 

Leu Leu Lys Pro Thr Cys Gly Arg Lys Val Ser Leu Pro Asn Lys Ala 
500 505 510 

Leu Glu Leu Lys Asp Arg Glu Thr Leu Lys Ala Glu Ser Pro Asp Asn 
515 520 525 

Asp Gly Leu Leu Lys Pro Thr Cys Gly Arg Lys Val Ser Leu Pro Asn 
530 535 540 

Lys Ala Leu Glu Leu Lys Asp Arg Glu Thr Phe Lys Ala Ala Gin Met 
545 550 555 560 

Phe Pro Ser Glu Ser Lys Gin Lys Asp Asp Glu Glu Asn Ser Trp Asp 
565 570 575 

Phe Glu Ser Phe Leu Glu Thr Leu Leu Gin Asn Asp Val Cys Leu Pro 
580 585 590 

Lys Ala Thr His Gin Lys Glu Phe Asp Thr Leu Ser Gly Lys Leu Glu 
595 600 605 

Glu Ser Pro Asp Lys Asp Gly Leu Leu Lys Pro Thr Cys Gly Met Lys 
610 615 620 

He Ser Leu Pro Asn Lys Ala Leu Glu Leu Lys Asp Arg Glu Thr Phe 
625 630 635 640 

Lys Ala Glu Asp Val Ser Ser Val Glu Ser Thr Phe Ser Leu Phe Gly 
645 650 655 

Lys Pro Thr Thr Glu Asn Ser Gin Ser Thr Lys Val Glu Glu Asp Phe 



660 665 670 

Asn Leu Thr Thr Lys Glu Gly Ala Thr Lys Thr Val Thr Gly Gin Gin 
675 680 685 

Glu Arg Asp He Gly He He Glu Arg Ala Pro Gin Asp Gin Thr Asn 
690 695 700 

Lys Met Pro Thr Ser Glu Leu Gly Arg Lys Glu Asp Thr Lys Ser Thr 
705 710 715 720 

Ser Asp Ser Glu He He Ser Val Ser Asp Thr Gin Asn Tyr Glu Cys 
725 730 735 

Leu Pro Glu Ala Thr Tyr Gin Lys Glu He Lys Thr Thr Asn Gly Lys 
740 745 750 

lie Glu Glu Ser Pro Glu Lys Pro Ser His Phe Glu Pro Ala Thr Glu 
755 760 765 

Met Gin Asn Ser Val Pro Asn Lys Gly Leu Glu Trp Lys Asn Lys Gin 
770 775 780 

Thr Leu Arg Ala Asp Ser Thr Thr Leu Ser Lys He Leu Asp Ala Leu 
785 790 795 800 

Pro Ser Cys Glu Arg Gly Arg Glu Leu Lys Lys Asp Asn Cys Glu Gin 
805 810 815 

He Thr Ala Lys Met Glu Gin Met Lys Asn Lys Phe Cys Val Leu Gin 
820 825 830 

Lys Glu Leu Ser Glu Ala Lys Glu He Lys Ser Gin Leu Glu Asn Gin 
835 840 845 

Lys Ala Lys Trp Glu Gin Glu Leu Cys Ser Val Arg Leu Pro Leu Asn 
850 855 860 

Gin Glu Glu Glu Lys Arg Arg Asn Val Asp He Leu Lys Glu Lys He 
865 870 875 880 

Arg Pro Glu Glu Gin Leu Arg Lys Lys Leu Glu Val Lys His Gin Leu 
885 890 895 

Glu Gin Thr Leu Arg He Gin Asp He Glu Leu Lys Ser Val Thr Ser 
900 905 910 

Asn Leu Asn Gin Val Ser His Thr His Glu Ser Glu Asn Asp Leu Phe 
915 920 925 

His Glu Asn Cys Met Leu Lys Lys Glu He Ala Met Leu Lys Leu Glu 
930 935 940 

Val Ala Thr Leu Lys His Gin His Gin Val Lys Glu Asn Lys Tyr Phe 
945 950 955 960 

Glu Asp He Lys He Leu Gin Glu Lys Asn Ala Glu Leu Gin Met Thr 
965 970 975 

Leu Lys Leu Lys Gin Lys Thr Val Thr Lys Arg Ala Ser Gin Tyr Arg 
980 985 990 

Glu Gin Leu Lys Val Leu Thr Ala Glu Asn Thr Met Leu Thr Ser Lys 
995 1000 1005 



Leu Lys Glu 
1010 



<210> 28 

<211> 23 

<212> DNA 

<213> Homo sapiens 

<220> 

<400> 28 

tctcatagat gctggtgctg ate 23 



<210> 29 

<211> 24 

<212> DNA 

<213> Homo sapiens 

<220> 

<400> 29 

cccagacatt gaattttggc agac 24 
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